Overview
Brought to you by YData
Dataset statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Number of variables | 37 | 24 |
| Number of observations | 21881 | 22588 |
| Missing cells | 344985 | 233282 |
| Missing cells (%) | 42.6% | 43.0% |
| Duplicate rows | 0 | 0 |
| Duplicate rows (%) | 0.0% | 0.0% |
| Total size in memory | 6.2 MiB | 4.1 MiB |
| Average record size in memory | 296.0 B | 192.0 B |
Variable types
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Text | 9 | 7 |
| Numeric | 3 | 3 |
| Categorical | 23 | 12 |
| Boolean | 2 | 2 |
| curated_md_report | concatenated_md_report | |
|---|---|---|
age_group is highly overall correlated with age_group_ontology_term_id and 4 other fields | age_group is highly overall correlated with age_max and 3 other fields | High correlation |
age_group_ontology_term_id is highly overall correlated with age_group and 4 other fields | Alert not present in this dataset | High correlation |
age_max is highly overall correlated with age_group and 5 other fields | age_max is highly overall correlated with age_group and 2 other fields | High correlation |
age_min is highly overall correlated with age_group and 5 other fields | age_min is highly overall correlated with age_group and 2 other fields | High correlation |
age_years is highly overall correlated with age_group and 7 other fields | age_years is highly overall correlated with age_group and 3 other fields | High correlation |
antibiotics_current_use is highly overall correlated with fmt_id and 3 other fields | antibiotics_current_use is highly overall correlated with fmt_id and 2 other fields | High correlation |
body_site is highly overall correlated with body_site_ontology_term_id and 9 other fields | body_site is highly overall correlated with dietary_restriction and 4 other fields | High correlation |
body_site_ontology_term_id is highly overall correlated with body_site and 9 other fields | Alert not present in this dataset | High correlation |
control is highly overall correlated with control_ontology_term_id and 5 other fields | control is highly overall correlated with fmt_id and 3 other fields | High correlation |
control_ontology_term_id is highly overall correlated with control and 5 other fields | Alert not present in this dataset | High correlation |
country is highly overall correlated with country_ontology_term_id and 10 other fields | country is highly overall correlated with dietary_restriction and 5 other fields | High correlation |
country_ontology_term_id is highly overall correlated with country and 10 other fields | Alert not present in this dataset | High correlation |
dietary_restriction is highly overall correlated with body_site and 9 other fields | dietary_restriction is highly overall correlated with body_site and 5 other fields | High correlation |
feces_phenotype_metric is highly overall correlated with age_years and 10 other fields | feces_phenotype_metric is highly overall correlated with age_years and 5 other fields | High correlation |
feces_phenotype_metric_ontology_term_id is highly overall correlated with age_years and 10 other fields | Alert not present in this dataset | High correlation |
fmt_id is highly overall correlated with age_group and 13 other fields | fmt_id is highly overall correlated with age_group and 8 other fields | High correlation |
fmt_role is highly overall correlated with body_site and 6 other fields | Alert not present in this dataset | High correlation |
hla is highly overall correlated with age_max and 10 other fields | Alert not present in this dataset | High correlation |
hla_ontology_term_id is highly overall correlated with age_max and 10 other fields | Alert not present in this dataset | High correlation |
sex is highly overall correlated with hla and 2 other fields | Alert not present in this dataset | High correlation |
sex_ontology_term_id is highly overall correlated with hla and 2 other fields | Alert not present in this dataset | High correlation |
smoker is highly overall correlated with country and 7 other fields | smoker is highly overall correlated with country and 3 other fields | High correlation |
smoker_ontology_term_id is highly overall correlated with country and 7 other fields | Alert not present in this dataset | High correlation |
target_condition is highly overall correlated with antibiotics_current_use and 15 other fields | target_condition is highly overall correlated with antibiotics_current_use and 7 other fields | High correlation |
target_condition_ontology_term_id is highly overall correlated with antibiotics_current_use and 15 other fields | Alert not present in this dataset | High correlation |
tumor_staging_ajcc is highly overall correlated with body_site and 6 other fields | tumor_staging_ajcc is highly overall correlated with body_site and 4 other fields | High correlation |
tumor_staging_tnm is highly overall correlated with antibiotics_current_use and 6 other fields | tumor_staging_tnm is highly overall correlated with antibiotics_current_use and 4 other fields | High correlation |
westernized is highly overall correlated with country and 12 other fields | westernized is highly overall correlated with country and 6 other fields | High correlation |
body_site is highly imbalanced (81.2%) | body_site is highly imbalanced (81.7%) | Imbalance |
body_site_ontology_term_id is highly imbalanced (81.2%) | Alert not present in this dataset | Imbalance |
westernized is highly imbalanced (68.3%) | westernized is highly imbalanced (69.0%) | Imbalance |
age_years has 8409 (38.4%) missing values | age_years has 9035 (40.0%) missing values | Missing |
biomarker has 18841 (86.1%) missing values | biomarker has 19548 (86.5%) missing values | Missing |
dietary_restriction has 21464 (98.1%) missing values | dietary_restriction has 22171 (98.2%) missing values | Missing |
feces_phenotype_metric has 20784 (95.0%) missing values | feces_phenotype_metric has 21491 (95.1%) missing values | Missing |
feces_phenotype_value has 20784 (95.0%) missing values | feces_phenotype_value has 21491 (95.1%) missing values | Missing |
feces_phenotype_metric_ontology_term_id has 20784 (95.0%) missing values | Alert not present in this dataset | Missing |
fmt_role has 21725 (99.3%) missing values | Alert not present in this dataset | Missing |
fmt_id has 21736 (99.3%) missing values | fmt_id has 22443 (99.4%) missing values | Missing |
sex has 2558 (11.7%) missing values | sex has 2558 (11.3%) missing values | Missing |
sex_ontology_term_id has 2558 (11.7%) missing values | Alert not present in this dataset | Missing |
hla has 20981 (95.9%) missing values | Alert not present in this dataset | Missing |
hla_ontology_term_id has 20981 (95.9%) missing values | Alert not present in this dataset | Missing |
smoker has 18901 (86.4%) missing values | smoker has 19608 (86.8%) missing values | Missing |
smoker_ontology_term_id has 18901 (86.4%) missing values | Alert not present in this dataset | Missing |
antibiotics_current_use has 7306 (33.4%) missing values | antibiotics_current_use has 7932 (35.1%) missing values | Missing |
treatment has 19534 (89.3%) missing values | treatment has 20241 (89.6%) missing values | Missing |
treatment_ontology_term_id has 16053 (73.4%) missing values | Alert not present in this dataset | Missing |
tumor_staging_ajcc has 21252 (97.1%) missing values | tumor_staging_ajcc has 21959 (97.2%) missing values | Missing |
tumor_staging_tnm has 21619 (98.8%) missing values | tumor_staging_tnm has 22326 (98.8%) missing values | Missing |
unmetadata has 19810 (90.5%) missing values | unmetadata has 20517 (90.8%) missing values | Missing |
| Alert not present in this dataset | age_min has 627 (2.8%) missing values | Missing |
| Alert not present in this dataset | age_max has 627 (2.8%) missing values | Missing |
| Alert not present in this dataset | control has 707 (3.1%) missing values | Missing |
Reproduction
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Analysis started | 2025-03-31 03:31:04.154718 | 2025-03-31 03:31:10.895815 |
| Analysis finished | 2025-03-31 03:31:10.887266 | 2025-03-31 03:31:14.891206 |
| Duration | 6.73 seconds | 4 seconds |
| Software version | ydata-profiling vv4.16.1 | ydata-profiling vv4.16.1 |
| Download configuration | config.json | config.json |
Variables
study_name
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 90 | 93 |
| Distinct (%) | 0.4% | 0.4% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 22 | 22 |
| Median length | 19 | 19 |
| Mean length | 12.773411 | 12.756242 |
| Min length | 8 | 8 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | AsnicarF_2017 | AsnicarF_2017 |
| 2nd row | AsnicarF_2017 | AsnicarF_2017 |
| 3rd row | AsnicarF_2017 | AsnicarF_2017 |
| 4th row | AsnicarF_2017 | AsnicarF_2017 |
| 5th row | AsnicarF_2017 | AsnicarF_2017 |
| Value | Count | Frequency (%) |
| metacardis_2020_a | 1831 | 8.4% |
| shaoy_2019 | 1644 | 7.5% |
| hmp_2019_ibdmdb | 1627 | 7.4% |
| lifelinesdeep_2016 | 1135 | 5.2% |
| asnicarf_2021 | 1098 | 5.0% |
| mehtars_2018 | 928 | 4.2% |
| zeevid_2015 | 900 | 4.1% |
| vatanent_2016 | 785 | 3.6% |
| hmp_2012 | 748 | 3.4% |
| yachidas_2019 | 616 | 2.8% |
| Other values (80) | 10569 |
| Value | Count | Frequency (%) |
| metacardis_2020_a | 1831 | 8.1% |
| shaoy_2019 | 1644 | 7.3% |
| hmp_2019_ibdmdb | 1627 | 7.2% |
| lifelinesdeep_2016 | 1135 | 5.0% |
| asnicarf_2021 | 1098 | 4.9% |
| mehtars_2018 | 928 | 4.1% |
| zeevid_2015 | 900 | 4.0% |
| vatanent_2016 | 785 | 3.5% |
| hmp_2012 | 748 | 3.3% |
| yachidas_2019 | 616 | 2.7% |
| Other values (83) | 11276 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 27391 | 9.8% |
| _ | 25877 | 9.3% |
| 0 | 24226 | 8.7% |
| 1 | 19206 | 6.9% |
| e | 16920 | 6.1% |
| a | 16355 | 5.9% |
| i | 13607 | 4.9% |
| n | 7790 | 2.8% |
| s | 7329 | 2.6% |
| r | 6667 | 2.4% |
| Other values (51) | 114127 |
| Value | Count | Frequency (%) |
| 2 | 28098 | 9.8% |
| _ | 26584 | 9.2% |
| 0 | 24933 | 8.7% |
| 1 | 19913 | 6.9% |
| a | 17143 | 5.9% |
| e | 16920 | 5.9% |
| i | 14043 | 4.9% |
| n | 8033 | 2.8% |
| s | 7871 | 2.7% |
| r | 6938 | 2.4% |
| Other values (51) | 117662 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 279495 |
| Value | Count | Frequency (%) |
| (unknown) | 288138 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 27391 | 9.8% |
| _ | 25877 | 9.3% |
| 0 | 24226 | 8.7% |
| 1 | 19206 | 6.9% |
| e | 16920 | 6.1% |
| a | 16355 | 5.9% |
| i | 13607 | 4.9% |
| n | 7790 | 2.8% |
| s | 7329 | 2.6% |
| r | 6667 | 2.4% |
| Other values (51) | 114127 |
| Value | Count | Frequency (%) |
| 2 | 28098 | 9.8% |
| _ | 26584 | 9.2% |
| 0 | 24933 | 8.7% |
| 1 | 19913 | 6.9% |
| a | 17143 | 5.9% |
| e | 16920 | 5.9% |
| i | 14043 | 4.9% |
| n | 8033 | 2.8% |
| s | 7871 | 2.7% |
| r | 6938 | 2.4% |
| Other values (51) | 117662 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 279495 |
| Value | Count | Frequency (%) |
| (unknown) | 288138 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 27391 | 9.8% |
| _ | 25877 | 9.3% |
| 0 | 24226 | 8.7% |
| 1 | 19206 | 6.9% |
| e | 16920 | 6.1% |
| a | 16355 | 5.9% |
| i | 13607 | 4.9% |
| n | 7790 | 2.8% |
| s | 7329 | 2.6% |
| r | 6667 | 2.4% |
| Other values (51) | 114127 |
| Value | Count | Frequency (%) |
| 2 | 28098 | 9.8% |
| _ | 26584 | 9.2% |
| 0 | 24933 | 8.7% |
| 1 | 19913 | 6.9% |
| a | 17143 | 5.9% |
| e | 16920 | 5.9% |
| i | 14043 | 4.9% |
| n | 8033 | 2.8% |
| s | 7871 | 2.7% |
| r | 6938 | 2.4% |
| Other values (51) | 117662 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 279495 |
| Value | Count | Frequency (%) |
| (unknown) | 288138 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 27391 | 9.8% |
| _ | 25877 | 9.3% |
| 0 | 24226 | 8.7% |
| 1 | 19206 | 6.9% |
| e | 16920 | 6.1% |
| a | 16355 | 5.9% |
| i | 13607 | 4.9% |
| n | 7790 | 2.8% |
| s | 7329 | 2.6% |
| r | 6667 | 2.4% |
| Other values (51) | 114127 |
| Value | Count | Frequency (%) |
| 2 | 28098 | 9.8% |
| _ | 26584 | 9.2% |
| 0 | 24933 | 8.7% |
| 1 | 19913 | 6.9% |
| a | 17143 | 5.9% |
| e | 16920 | 5.9% |
| i | 14043 | 4.9% |
| n | 8033 | 2.8% |
| s | 7871 | 2.7% |
| r | 6938 | 2.4% |
| Other values (51) | 117662 |
sample_id
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 21704 | 22411 |
| Distinct (%) | 99.2% | 99.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 57 | 57 |
| Median length | 54 | 54 |
| Mean length | 14.422878 | 14.524172 |
| Min length | 2 | 2 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 21527 | 22234 ? |
| Unique (%) | 98.4% | 98.4% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | MV_FEI1_t1Q14 | MV_FEI1_t1Q14 |
| 2nd row | MV_FEI2_t1Q14 | MV_FEI2_t1Q14 |
| 3rd row | MV_FEI3_t1Q14 | MV_FEI3_t1Q14 |
| 4th row | MV_FEI4_t1Q14 | MV_FEI4_t1Q14 |
| 5th row | MV_FEI4_t2Q15 | MV_FEI4_t2Q15 |
| Value | Count | Frequency (%) |
| mh0039 | 2 | < 0.1% |
| mh0081 | 2 | < 0.1% |
| mh0059 | 2 | < 0.1% |
| mh0048 | 2 | < 0.1% |
| mh0126 | 2 | < 0.1% |
| mh0040 | 2 | < 0.1% |
| mh0041 | 2 | < 0.1% |
| mh0042 | 2 | < 0.1% |
| mh0043 | 2 | < 0.1% |
| mh0044 | 2 | < 0.1% |
| Other values (21694) | 21861 |
| Value | Count | Frequency (%) |
| mh0132 | 2 | < 0.1% |
| mh0127 | 2 | < 0.1% |
| mh0143 | 2 | < 0.1% |
| mh0144 | 2 | < 0.1% |
| mh0145 | 2 | < 0.1% |
| mh0146 | 2 | < 0.1% |
| mh0148 | 2 | < 0.1% |
| mh0139 | 2 | < 0.1% |
| mh0149 | 2 | < 0.1% |
| mh0079 | 2 | < 0.1% |
| Other values (22401) | 22568 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 40630 | 12.9% |
| 1 | 26414 | 8.4% |
| 2 | 16856 | 5.3% |
| 6 | 15293 | 4.8% |
| 4 | 14314 | 4.5% |
| 9 | 14047 | 4.5% |
| S | 13725 | 4.3% |
| M | 13660 | 4.3% |
| 7 | 13500 | 4.3% |
| 3 | 12757 | 4.0% |
| Other values (55) | 134391 |
| Value | Count | Frequency (%) |
| 0 | 44533 | 13.6% |
| 1 | 27806 | 8.5% |
| 2 | 17408 | 5.3% |
| 6 | 16037 | 4.9% |
| 4 | 14743 | 4.5% |
| 9 | 14265 | 4.3% |
| 7 | 14162 | 4.3% |
| M | 13741 | 4.2% |
| S | 13725 | 4.2% |
| 3 | 13528 | 4.1% |
| Other values (55) | 138124 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 315587 |
| Value | Count | Frequency (%) |
| (unknown) | 328072 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 40630 | 12.9% |
| 1 | 26414 | 8.4% |
| 2 | 16856 | 5.3% |
| 6 | 15293 | 4.8% |
| 4 | 14314 | 4.5% |
| 9 | 14047 | 4.5% |
| S | 13725 | 4.3% |
| M | 13660 | 4.3% |
| 7 | 13500 | 4.3% |
| 3 | 12757 | 4.0% |
| Other values (55) | 134391 |
| Value | Count | Frequency (%) |
| 0 | 44533 | 13.6% |
| 1 | 27806 | 8.5% |
| 2 | 17408 | 5.3% |
| 6 | 16037 | 4.9% |
| 4 | 14743 | 4.5% |
| 9 | 14265 | 4.3% |
| 7 | 14162 | 4.3% |
| M | 13741 | 4.2% |
| S | 13725 | 4.2% |
| 3 | 13528 | 4.1% |
| Other values (55) | 138124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 315587 |
| Value | Count | Frequency (%) |
| (unknown) | 328072 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 40630 | 12.9% |
| 1 | 26414 | 8.4% |
| 2 | 16856 | 5.3% |
| 6 | 15293 | 4.8% |
| 4 | 14314 | 4.5% |
| 9 | 14047 | 4.5% |
| S | 13725 | 4.3% |
| M | 13660 | 4.3% |
| 7 | 13500 | 4.3% |
| 3 | 12757 | 4.0% |
| Other values (55) | 134391 |
| Value | Count | Frequency (%) |
| 0 | 44533 | 13.6% |
| 1 | 27806 | 8.5% |
| 2 | 17408 | 5.3% |
| 6 | 16037 | 4.9% |
| 4 | 14743 | 4.5% |
| 9 | 14265 | 4.3% |
| 7 | 14162 | 4.3% |
| M | 13741 | 4.2% |
| S | 13725 | 4.2% |
| 3 | 13528 | 4.1% |
| Other values (55) | 138124 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 315587 |
| Value | Count | Frequency (%) |
| (unknown) | 328072 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 40630 | 12.9% |
| 1 | 26414 | 8.4% |
| 2 | 16856 | 5.3% |
| 6 | 15293 | 4.8% |
| 4 | 14314 | 4.5% |
| 9 | 14047 | 4.5% |
| S | 13725 | 4.3% |
| M | 13660 | 4.3% |
| 7 | 13500 | 4.3% |
| 3 | 12757 | 4.0% |
| Other values (55) | 134391 |
| Value | Count | Frequency (%) |
| 0 | 44533 | 13.6% |
| 1 | 27806 | 8.5% |
| 2 | 17408 | 5.3% |
| 6 | 16037 | 4.9% |
| 4 | 14743 | 4.5% |
| 9 | 14265 | 4.3% |
| 7 | 14162 | 4.3% |
| M | 13741 | 4.2% |
| S | 13725 | 4.2% |
| 3 | 13528 | 4.1% |
| Other values (55) | 138124 |
age_years
Real number (ℝ)
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 464 | 465 |
| Distinct (%) | 3.4% | 3.4% |
| Missing | 8409 | 9035 |
| Missing (%) | 38.4% | 40.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 32.742141 | 32.896563 |
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 92 | 92 |
| Zeros | 43 | 43 |
| Zeros (%) | 0.2% | 0.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Quantile statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.019178082 | 0.019178082 |
| Q1 | 6 | 6 |
| median | 32 | 32 |
| Q3 | 54 | 54 |
| 95-th percentile | 71 | 71 |
| Maximum | 92 | 92 |
| Range | 92 | 92 |
| Interquartile range (IQR) | 48 | 48 |
Descriptive statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Standard deviation | 24.79753 | 24.817257 |
| Coefficient of variation (CV) | 0.75735823 | 0.75440274 |
| Kurtosis | -1.2321976 | -1.2354848 |
| Mean | 32.742141 | 32.896563 |
| Median Absolute Deviation (MAD) | 23 | 23 |
| Skewness | 0.093306608 | 0.08417056 |
| Sum | 441102.12 | 445847.12 |
| Variance | 614.91748 | 615.89626 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01917808219 | 539 | 2.5% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 280 | 1.3% |
| 23 | 243 | 1.1% |
| 26 | 232 | 1.1% |
| 32 | 231 | 1.1% |
| 27 | 226 | 1.0% |
| 50 | 224 | 1.0% |
| Other values (454) | 10428 | |
| (Missing) | 8409 |
| Value | Count | Frequency (%) |
| 0.01917808219 | 539 | 2.4% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 287 | 1.3% |
| 23 | 243 | 1.1% |
| 26 | 232 | 1.0% |
| 32 | 231 | 1.0% |
| 50 | 226 | 1.0% |
| 27 | 226 | 1.0% |
| Other values (455) | 10500 | |
| (Missing) | 9035 |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
age_min
Real number (ℝ)
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 464 | 465 |
| Distinct (%) | 2.1% | 2.1% |
| Missing | 1 | 627 |
| Missing (%) | < 0.1% | 2.8% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 28.489357 | 28.600342 |
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 92 | 92 |
| Zeros | 187 | 187 |
| Zeros (%) | 0.9% | 0.8% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Quantile statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.019178082 | 0.019178082 |
| Q1 | 18 | 18 |
| median | 18 | 18 |
| Q3 | 46 | 46 |
| 95-th percentile | 69 | 69 |
| Maximum | 92 | 92 |
| Range | 92 | 92 |
| Interquartile range (IQR) | 28 | 28 |
Descriptive statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Standard deviation | 21.920538 | 21.965644 |
| Coefficient of variation (CV) | 0.76942902 | 0.7680203 |
| Kurtosis | -0.6680911 | -0.68487874 |
| Mean | 28.489357 | 28.600342 |
| Median Absolute Deviation (MAD) | 14 | 14 |
| Skewness | 0.64477406 | 0.63645516 |
| Sum | 623347.12 | 628092.12 |
| Variance | 480.50998 | 482.4895 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 7465 | |
| 65 | 922 | 4.2% |
| 0.01917808219 | 539 | 2.5% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 280 | 1.3% |
| 23 | 243 | 1.1% |
| 26 | 232 | 1.1% |
| 32 | 231 | 1.1% |
| Other values (454) | 10899 |
| Value | Count | Frequency (%) |
| 18 | 7465 | |
| 65 | 926 | 4.1% |
| 0.01917808219 | 539 | 2.4% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 287 | 1.3% |
| 23 | 243 | 1.1% |
| 26 | 232 | 1.0% |
| 32 | 231 | 1.0% |
| Other values (455) | 10969 | |
| (Missing) | 627 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 187 | 0.9% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 187 | 0.8% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 187 | 0.9% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 187 | 0.8% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
age_max
Real number (ℝ)
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 465 | 466 |
| Distinct (%) | 2.1% | 2.1% |
| Missing | 1 | 627 |
| Missing (%) | < 0.1% | 2.8% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 46.680581 | 46.724472 |
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 130 | 130 |
| Zeros | 43 | 43 |
| Zeros (%) | 0.2% | 0.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Quantile statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.030136986 | 0.030136986 |
| Q1 | 24 | 24 |
| median | 57 | 57 |
| Q3 | 65 | 65 |
| 95-th percentile | 75 | 75 |
| Maximum | 130 | 130 |
| Range | 130 | 130 |
| Interquartile range (IQR) | 41 | 41 |
Descriptive statistics
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Standard deviation | 29.434893 | 29.396631 |
| Coefficient of variation (CV) | 0.63055969 | 0.62914849 |
| Kurtosis | 0.3773218 | 0.38222039 |
| Mean | 46.680581 | 46.724472 |
| Median Absolute Deviation (MAD) | 11 | 11 |
| Skewness | 0.16965504 | 0.16608084 |
| Sum | 1021371.1 | 1026116.1 |
| Variance | 866.41293 | 864.16191 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 7595 | |
| 130 | 743 | 3.4% |
| 0.01917808219 | 539 | 2.5% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 280 | 1.3% |
| 23 | 243 | 1.1% |
| 2 | 239 | 1.1% |
| 26 | 232 | 1.1% |
| Other values (455) | 10940 |
| Value | Count | Frequency (%) |
| 65 | 7599 | |
| 130 | 743 | 3.3% |
| 0.01917808219 | 539 | 2.4% |
| 1 | 418 | 1.9% |
| 0.05753424658 | 335 | 1.5% |
| 0.01095890411 | 316 | 1.4% |
| 51 | 287 | 1.3% |
| 23 | 243 | 1.1% |
| 2 | 239 | 1.1% |
| 26 | 232 | 1.0% |
| Other values (456) | 11010 | |
| (Missing) | 627 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.2% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 43 | 0.2% |
| 0.002739726027 | 21 | 0.1% |
| 0.005479452055 | 24 | 0.1% |
| 0.008219178082 | 33 | 0.1% |
| 0.01095890411 | 316 | |
| 0.01369863014 | 29 | 0.1% |
| 0.01643835616 | 12 | 0.1% |
| 0.01917808219 | 539 | |
| 0.02191780822 | 26 | 0.1% |
| 0.02465753425 | 22 | 0.1% |
age_group
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 1 | 1 |
| Missing (%) | < 0.1% | < 0.1% |
| Memory size | 171.1 KiB | 176.6 KiB |
| Adult | |
|---|---|
| Infant | |
| Elderly | |
| Adolescent | 760 |
| Children 2-11 Years Old | 567 |
| Adult | |
|---|---|
| Infant | |
| Elderly | |
| Adolescent | 760 |
| Children 2-11 Years Old | 567 |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 23 | 23 |
| Median length | 5 | 5 |
| Mean length | 6.0051188 | 5.9824678 |
| Min length | 5 | 5 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Infant | Infant |
| 2nd row | Infant | Infant |
| 3rd row | Infant | Infant |
| 4th row | Infant | Infant |
| 5th row | Infant | Infant |
Common Values
| Value | Count | Frequency (%) |
| Adult | 14924 | |
| Infant | 3272 | 15.0% |
| Elderly | 2357 | 10.8% |
| Adolescent | 760 | 3.5% |
| Children 2-11 Years Old | 567 | 2.6% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| Adult | 15454 | |
| Infant | 3427 | 15.2% |
| Elderly | 2379 | 10.5% |
| Adolescent | 760 | 3.4% |
| Children 2-11 Years Old | 567 | 2.5% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| adult | 14924 | |
| infant | 3272 | 13.9% |
| elderly | 2357 | 10.0% |
| adolescent | 760 | 3.2% |
| children | 567 | 2.4% |
| 2-11 | 567 | 2.4% |
| years | 567 | 2.4% |
| old | 567 | 2.4% |
| Value | Count | Frequency (%) |
| adult | 15454 | |
| infant | 3427 | 14.1% |
| elderly | 2379 | 9.8% |
| adolescent | 760 | 3.1% |
| children | 567 | 2.3% |
| 2-11 | 567 | 2.3% |
| years | 567 | 2.3% |
| old | 567 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 21532 | |
| d | 19175 | |
| t | 18956 | |
| A | 15684 | |
| u | 14924 | |
| n | 7871 | 6.0% |
| e | 5011 | 3.8% |
| a | 3839 | 2.9% |
| r | 3491 | 2.7% |
| I | 3272 | 2.5% |
| Other values (15) | 17637 |
| Value | Count | Frequency (%) |
| l | 22106 | |
| d | 19727 | |
| t | 19641 | |
| A | 16214 | |
| u | 15454 | |
| n | 8181 | 6.1% |
| e | 5033 | 3.7% |
| a | 3994 | 3.0% |
| r | 3513 | 2.6% |
| I | 3427 | 2.5% |
| Other values (15) | 17836 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 131392 |
| Value | Count | Frequency (%) |
| (unknown) | 135126 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 21532 | |
| d | 19175 | |
| t | 18956 | |
| A | 15684 | |
| u | 14924 | |
| n | 7871 | 6.0% |
| e | 5011 | 3.8% |
| a | 3839 | 2.9% |
| r | 3491 | 2.7% |
| I | 3272 | 2.5% |
| Other values (15) | 17637 |
| Value | Count | Frequency (%) |
| l | 22106 | |
| d | 19727 | |
| t | 19641 | |
| A | 16214 | |
| u | 15454 | |
| n | 8181 | 6.1% |
| e | 5033 | 3.7% |
| a | 3994 | 3.0% |
| r | 3513 | 2.6% |
| I | 3427 | 2.5% |
| Other values (15) | 17836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 131392 |
| Value | Count | Frequency (%) |
| (unknown) | 135126 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 21532 | |
| d | 19175 | |
| t | 18956 | |
| A | 15684 | |
| u | 14924 | |
| n | 7871 | 6.0% |
| e | 5011 | 3.8% |
| a | 3839 | 2.9% |
| r | 3491 | 2.7% |
| I | 3272 | 2.5% |
| Other values (15) | 17637 |
| Value | Count | Frequency (%) |
| l | 22106 | |
| d | 19727 | |
| t | 19641 | |
| A | 16214 | |
| u | 15454 | |
| n | 8181 | 6.1% |
| e | 5033 | 3.7% |
| a | 3994 | 3.0% |
| r | 3513 | 2.6% |
| I | 3427 | 2.5% |
| Other values (15) | 17836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 131392 |
| Value | Count | Frequency (%) |
| (unknown) | 135126 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 21532 | |
| d | 19175 | |
| t | 18956 | |
| A | 15684 | |
| u | 14924 | |
| n | 7871 | 6.0% |
| e | 5011 | 3.8% |
| a | 3839 | 2.9% |
| r | 3491 | 2.7% |
| I | 3272 | 2.5% |
| Other values (15) | 17637 |
| Value | Count | Frequency (%) |
| l | 22106 | |
| d | 19727 | |
| t | 19641 | |
| A | 16214 | |
| u | 15454 | |
| n | 8181 | 6.1% |
| e | 5033 | 3.7% |
| a | 3994 | 3.0% |
| r | 3513 | 2.6% |
| I | 3427 | 2.5% |
| Other values (15) | 17836 |
age_group_ontology_term_id
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 171.1 KiB |
| NCIT:C49685 | |
|---|---|
| NCIT:C27956 | |
| NCIT:C16268 | |
| NCIT:C27954 | 760 |
| NCIT:C49683 | 567 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NCIT:C27956 |
|---|---|
| 2nd row | NCIT:C27956 |
| 3rd row | NCIT:C27956 |
| 4th row | NCIT:C27956 |
| 5th row | NCIT:C27956 |
Common Values
| Value | Count | Frequency (%) |
| NCIT:C49685 | 14924 | |
| NCIT:C27956 | 3272 | 15.0% |
| NCIT:C16268 | 2357 | 10.8% |
| NCIT:C27954 | 760 | 3.5% |
| NCIT:C49683 | 567 | 2.6% |
| (Missing) | 1 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| ncit:c49685 | 14924 | |
| ncit:c27956 | 3272 | 15.0% |
| ncit:c16268 | 2357 | 10.8% |
| ncit:c27954 | 760 | 3.5% |
| ncit:c49683 | 567 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 43760 | |
| 6 | 23477 | |
| N | 21880 | |
| I | 21880 | |
| T | 21880 | |
| : | 21880 | |
| 9 | 19523 | |
| 5 | 18956 | |
| 8 | 17848 | |
| 4 | 16251 | 6.8% |
| Other values (4) | 13345 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 240680 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 43760 | |
| 6 | 23477 | |
| N | 21880 | |
| I | 21880 | |
| T | 21880 | |
| : | 21880 | |
| 9 | 19523 | |
| 5 | 18956 | |
| 8 | 17848 | |
| 4 | 16251 | 6.8% |
| Other values (4) | 13345 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 240680 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 43760 | |
| 6 | 23477 | |
| N | 21880 | |
| I | 21880 | |
| T | 21880 | |
| : | 21880 | |
| 9 | 19523 | |
| 5 | 18956 | |
| 8 | 17848 | |
| 4 | 16251 | 6.8% |
| Other values (4) | 13345 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 240680 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 43760 | |
| 6 | 23477 | |
| N | 21880 | |
| I | 21880 | |
| T | 21880 | |
| : | 21880 | |
| 9 | 19523 | |
| 5 | 18956 | |
| 8 | 17848 | |
| 4 | 16251 | 6.8% |
| Other values (4) | 13345 | 5.5% |
biomarker
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 2805 | 2805 |
| Distinct (%) | 92.3% | 92.3% |
| Missing | 18841 | 19548 |
| Missing (%) | 86.1% | 86.5% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 628 | 628 |
| Median length | 616 | 616 |
| Mean length | 198.78684 | 198.78684 |
| Min length | 25 | 25 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 2702 | 2702 ? |
| Unique (%) | 88.9% | 88.9% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Alanine_Aminotransferase_in_U/L:45;Albumin_in_g/dL:49;Aspartate_Aminotransferase_in_U/L:34;Creatine_in_umol/L:56;Erythrocyte_Sedimentation_Rate_in_mm/hr:4;Globulin_Protein_in_g/L:32;High_Sensitivity_C-Reactive_Protein_in_mg/L:5;Urea_Nitrogen_in_mmol/L:4.1 | Alanine_Aminotransferase_in_U/L:45;Albumin_in_g/dL:49;Aspartate_Aminotransferase_in_U/L:34;Creatine_in_umol/L:56;Erythrocyte_Sedimentation_Rate_in_mm/hr:4;Globulin_Protein_in_g/L:32;High_Sensitivity_C-Reactive_Protein_in_mg/L:5;Urea_Nitrogen_in_mmol/L:4.1 |
| 2nd row | Alanine_Aminotransferase_in_U/L:54;Albumin_in_g/dL:44.3;Aspartate_Aminotransferase_in_U/L:36;Creatine_in_umol/L:96;Erythrocyte_Sedimentation_Rate_in_mm/hr:3;Globulin_Protein_in_g/L:20.8;High_Sensitivity_C-Reactive_Protein_in_mg/L:8;Urea_Nitrogen_in_mmol/L:6.87 | Alanine_Aminotransferase_in_U/L:54;Albumin_in_g/dL:44.3;Aspartate_Aminotransferase_in_U/L:36;Creatine_in_umol/L:96;Erythrocyte_Sedimentation_Rate_in_mm/hr:3;Globulin_Protein_in_g/L:20.8;High_Sensitivity_C-Reactive_Protein_in_mg/L:8;Urea_Nitrogen_in_mmol/L:6.87 |
| 3rd row | Alanine_Aminotransferase_in_U/L:34;Albumin_in_g/dL:49;Aspartate_Aminotransferase_in_U/L:21;Creatine_in_umol/L:75;Erythrocyte_Sedimentation_Rate_in_mm/hr:24;Globulin_Protein_in_g/L:18.9;High_Sensitivity_C-Reactive_Protein_in_mg/L:8;Urea_Nitrogen_in_mmol/L:3.78 | Alanine_Aminotransferase_in_U/L:34;Albumin_in_g/dL:49;Aspartate_Aminotransferase_in_U/L:21;Creatine_in_umol/L:75;Erythrocyte_Sedimentation_Rate_in_mm/hr:24;Globulin_Protein_in_g/L:18.9;High_Sensitivity_C-Reactive_Protein_in_mg/L:8;Urea_Nitrogen_in_mmol/L:3.78 |
| 4th row | Alanine_Aminotransferase_in_U/L:22;Albumin_in_g/dL:40.1;Aspartate_Aminotransferase_in_U/L:29;Creatine_in_umol/L:64;Erythrocyte_Sedimentation_Rate_in_mm/hr:11;Globulin_Protein_in_g/L:31.9;High_Sensitivity_C-Reactive_Protein_in_mg/L:2.3;Urea_Nitrogen_in_mmol/L:4.01 | Alanine_Aminotransferase_in_U/L:22;Albumin_in_g/dL:40.1;Aspartate_Aminotransferase_in_U/L:29;Creatine_in_umol/L:64;Erythrocyte_Sedimentation_Rate_in_mm/hr:11;Globulin_Protein_in_g/L:31.9;High_Sensitivity_C-Reactive_Protein_in_mg/L:2.3;Urea_Nitrogen_in_mmol/L:4.01 |
| 5th row | Alanine_Aminotransferase_in_U/L:18;Albumin_in_g/dL:41.6;Aspartate_Aminotransferase_in_U/L:18;Creatine_in_umol/L:80.4;Erythrocyte_Sedimentation_Rate_in_mm/hr:18;Globulin_Protein_in_g/L:26.6;High_Sensitivity_C-Reactive_Protein_in_mg/L:3.9;Urea_Nitrogen_in_mmol/L:5.53 | Alanine_Aminotransferase_in_U/L:18;Albumin_in_g/dL:41.6;Aspartate_Aminotransferase_in_U/L:18;Creatine_in_umol/L:80.4;Erythrocyte_Sedimentation_Rate_in_mm/hr:18;Globulin_Protein_in_g/L:26.6;High_Sensitivity_C-Reactive_Protein_in_mg/L:3.9;Urea_Nitrogen_in_mmol/L:5.53 |
| Value | Count | Frequency (%) |
| diastolic_blood_pressure_in_mm/hg:80;systolic_blood_pressure_in_mm/hg:120 | 29 | 1.0% |
| autoantibody_titer_measurement_(procedure):iaa;gada;ia-2a;znt8a;ica | 28 | 0.9% |
| diastolic_blood_pressure_in_mm/hg:70;systolic_blood_pressure_in_mm/hg:110 | 13 | 0.4% |
| autoantibody_titer_measurement_(procedure):iaa;gada | 12 | 0.4% |
| autoantibody_titer_measurement_(procedure):iaa;gada;znt8a;ica | 10 | 0.3% |
| autoantibody_titer_measurement_(procedure):iaa;ia-2a;znt8a;ica | 9 | 0.3% |
| autoantibody_titer_measurement_(procedure):iaa;gada;ia-2a;ica | 7 | 0.2% |
| cholesterol_in_mg/dl:211.1382;creatinine_in_umol/l:80.19;high_density_lipoprotein_cholesterol_in_mg/dl:51.0444;ldl_particles_in_mg/dl:128.3844;triglyceride_in_mg/dl:158.5403 | 7 | 0.2% |
| autoantibody_titer_measurement_(procedure):iaa;ica | 7 | 0.2% |
| diastolic_blood_pressure_in_mm/hg:60;systolic_blood_pressure_in_mm/hg:100 | 5 | 0.2% |
| Other values (2795) | 2913 |
| Value | Count | Frequency (%) |
| diastolic_blood_pressure_in_mm/hg:80;systolic_blood_pressure_in_mm/hg:120 | 29 | 1.0% |
| autoantibody_titer_measurement_(procedure):iaa;gada;ia-2a;znt8a;ica | 28 | 0.9% |
| diastolic_blood_pressure_in_mm/hg:70;systolic_blood_pressure_in_mm/hg:110 | 13 | 0.4% |
| autoantibody_titer_measurement_(procedure):iaa;gada | 12 | 0.4% |
| autoantibody_titer_measurement_(procedure):iaa;gada;znt8a;ica | 10 | 0.3% |
| autoantibody_titer_measurement_(procedure):iaa;ia-2a;znt8a;ica | 9 | 0.3% |
| autoantibody_titer_measurement_(procedure):iaa;gada;ia-2a;ica | 7 | 0.2% |
| cholesterol_in_mg/dl:211.1382;creatinine_in_umol/l:80.19;high_density_lipoprotein_cholesterol_in_mg/dl:51.0444;ldl_particles_in_mg/dl:128.3844;triglyceride_in_mg/dl:158.5403 | 7 | 0.2% |
| autoantibody_titer_measurement_(procedure):iaa;ica | 7 | 0.2% |
| diastolic_blood_pressure_in_mm/hg:60;systolic_blood_pressure_in_mm/hg:100 | 5 | 0.2% |
| Other values (2795) | 2913 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
| Value | Count | Frequency (%) |
| (unknown) | 604312 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
| Value | Count | Frequency (%) |
| _ | 55291 | 9.1% |
| i | 48146 | 8.0% |
| e | 38229 | 6.3% |
| n | 33837 | 5.6% |
| o | 31170 | 5.2% |
| l | 28005 | 4.6% |
| m | 24744 | 4.1% |
| t | 23738 | 3.9% |
| r | 22611 | 3.7% |
| g | 18271 | 3.0% |
| Other values (52) | 280270 |
body_site
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 24 | 24 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
| feces | |
|---|---|
| feces;rectum | 923 |
| skin epidermis | 373 |
| oral cavity | 220 |
| oral cavity;dorsum of tongue | 198 |
| Other values (19) | 767 |
| feces | |
|---|---|
| feces;rectum | 923 |
| skin epidermis | 373 |
| oral cavity | 220 |
| oral cavity;dorsum of tongue | 198 |
| Other values (19) | 767 |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 39 | 39 |
| Median length | 5 | 5 |
| Mean length | 6.5951282 | 6.545201 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 1 | 1 ? |
| Unique (%) | < 0.1% | < 0.1% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | feces | feces |
| 2nd row | feces | feces |
| 3rd row | feces | feces |
| 4th row | feces | feces |
| 5th row | feces | feces |
Common Values
| Value | Count | Frequency (%) |
| feces | 19400 | |
| feces;rectum | 923 | 4.2% |
| skin epidermis | 373 | 1.7% |
| oral cavity | 220 | 1.0% |
| oral cavity;dorsum of tongue | 198 | 0.9% |
| oral cavity;subgingival dental plaque | 168 | 0.8% |
| oral cavity;supragingival dental plaque | 127 | 0.6% |
| oral cavity;buccal mucosa | 119 | 0.5% |
| nasal cavity;anterior naris | 93 | 0.4% |
| vagina;posterior fornix of vagina | 62 | 0.3% |
| Other values (14) | 198 | 0.9% |
| Value | Count | Frequency (%) |
| feces | 20107 | |
| feces;rectum | 923 | 4.1% |
| skin epidermis | 373 | 1.7% |
| oral cavity | 220 | 1.0% |
| oral cavity;dorsum of tongue | 198 | 0.9% |
| oral cavity;subgingival dental plaque | 168 | 0.7% |
| oral cavity;supragingival dental plaque | 127 | 0.6% |
| oral cavity;buccal mucosa | 119 | 0.5% |
| nasal cavity;anterior naris | 93 | 0.4% |
| vagina;posterior fornix of vagina | 62 | 0.3% |
| Other values (14) | 198 | 0.9% |
Length
Common Values (Plot)
curated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)concatenated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| feces | 19400 | |
| feces;rectum | 923 | 3.7% |
| oral | 857 | 3.5% |
| skin | 504 | 2.0% |
| epidermis | 373 | 1.5% |
| plaque | 295 | 1.2% |
| dental | 295 | 1.2% |
| of | 260 | 1.0% |
| cavity | 220 | 0.9% |
| cavity;dorsum | 198 | 0.8% |
| Other values (31) | 1502 | 6.0% |
| Value | Count | Frequency (%) |
| feces | 20107 | |
| feces;rectum | 923 | 3.6% |
| oral | 857 | 3.4% |
| skin | 504 | 2.0% |
| epidermis | 373 | 1.5% |
| plaque | 295 | 1.2% |
| dental | 295 | 1.2% |
| of | 260 | 1.0% |
| cavity | 220 | 0.9% |
| cavity;dorsum | 198 | 0.8% |
| Other values (31) | 1502 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 43731 | |
| c | 22619 | |
| s | 22217 | |
| f | 20689 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.6% |
| r | 3313 | 2.3% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.3% |
| Value | Count | Frequency (%) |
| e | 45145 | |
| c | 23326 | |
| s | 22924 | |
| f | 21396 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.5% |
| r | 3313 | 2.2% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 144308 |
| Value | Count | Frequency (%) |
| (unknown) | 147843 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 43731 | |
| c | 22619 | |
| s | 22217 | |
| f | 20689 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.6% |
| r | 3313 | 2.3% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.3% |
| Value | Count | Frequency (%) |
| e | 45145 | |
| c | 23326 | |
| s | 22924 | |
| f | 21396 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.5% |
| r | 3313 | 2.2% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 144308 |
| Value | Count | Frequency (%) |
| (unknown) | 147843 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 43731 | |
| c | 22619 | |
| s | 22217 | |
| f | 20689 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.6% |
| r | 3313 | 2.3% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.3% |
| Value | Count | Frequency (%) |
| e | 45145 | |
| c | 23326 | |
| s | 22924 | |
| f | 21396 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.5% |
| r | 3313 | 2.2% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 144308 |
| Value | Count | Frequency (%) |
| (unknown) | 147843 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 43731 | |
| c | 22619 | |
| s | 22217 | |
| f | 20689 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.6% |
| r | 3313 | 2.3% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.3% |
| Value | Count | Frequency (%) |
| e | 45145 | |
| c | 23326 | |
| s | 22924 | |
| f | 21396 | |
| a | 3939 | 2.7% |
| i | 3707 | 2.5% |
| r | 3313 | 2.2% |
| 2946 | 2.0% | |
| t | 2638 | 1.8% |
| u | 2201 | 1.5% |
| Other values (16) | 16308 | 11.0% |
body_site_ontology_term_id
Categorical
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.1 KiB |
| UBERON:0001988 | |
|---|---|
| UBERON:0001988;UBERON:0001052 | 923 |
| UBERON:0001003 | 373 |
| UBERON:0000167 | 220 |
| UBERON:0000167;UBERON:0009471 | 198 |
| Other values (19) | 767 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 15.281934 |
| Min length | 14 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | UBERON:0001988 |
|---|---|
| 2nd row | UBERON:0001988 |
| 3rd row | UBERON:0001988 |
| 4th row | UBERON:0001988 |
| 5th row | UBERON:0001988 |
Common Values
| Value | Count | Frequency (%) |
| UBERON:0001988 | 19400 | |
| UBERON:0001988;UBERON:0001052 | 923 | 4.2% |
| UBERON:0001003 | 373 | 1.7% |
| UBERON:0000167 | 220 | 1.0% |
| UBERON:0000167;UBERON:0009471 | 198 | 0.9% |
| UBERON:0000167;UBERON:0016484 | 168 | 0.8% |
| UBERON:0000167;UBERON:0016485 | 127 | 0.6% |
| UBERON:0000167;UBERON:0006956 | 119 | 0.5% |
| UBERON:0001707;UBERON:2001427 | 93 | 0.4% |
| UBERON:0000996;UBERON:0016486 | 62 | 0.3% |
| Other values (14) | 198 | 0.9% |
Length
| Value | Count | Frequency (%) |
| uberon:0001988 | 19400 | |
| uberon:0001988;uberon:0001052 | 923 | 4.2% |
| uberon:0001003 | 373 | 1.7% |
| uberon:0000167 | 220 | 1.0% |
| uberon:0000167;uberon:0009471 | 198 | 0.9% |
| uberon:0000167;uberon:0016484 | 168 | 0.8% |
| uberon:0000167;uberon:0016485 | 127 | 0.6% |
| uberon:0000167;uberon:0006956 | 119 | 0.5% |
| uberon:0001707;uberon:2001427 | 93 | 0.4% |
| uberon:0000996;uberon:0016486 | 62 | 0.3% |
| Other values (14) | 198 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 73688 | |
| 8 | 41074 | |
| U | 23751 | 7.1% |
| E | 23751 | 7.1% |
| R | 23751 | 7.1% |
| O | 23751 | 7.1% |
| N | 23751 | 7.1% |
| : | 23751 | 7.1% |
| B | 23751 | 7.1% |
| 1 | 23552 | 7.0% |
| Other values (8) | 29813 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 334384 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 73688 | |
| 8 | 41074 | |
| U | 23751 | 7.1% |
| E | 23751 | 7.1% |
| R | 23751 | 7.1% |
| O | 23751 | 7.1% |
| N | 23751 | 7.1% |
| : | 23751 | 7.1% |
| B | 23751 | 7.1% |
| 1 | 23552 | 7.0% |
| Other values (8) | 29813 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 334384 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 73688 | |
| 8 | 41074 | |
| U | 23751 | 7.1% |
| E | 23751 | 7.1% |
| R | 23751 | 7.1% |
| O | 23751 | 7.1% |
| N | 23751 | 7.1% |
| : | 23751 | 7.1% |
| B | 23751 | 7.1% |
| 1 | 23552 | 7.0% |
| Other values (8) | 29813 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 334384 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 73688 | |
| 8 | 41074 | |
| U | 23751 | 7.1% |
| E | 23751 | 7.1% |
| R | 23751 | 7.1% |
| O | 23751 | 7.1% |
| N | 23751 | 7.1% |
| : | 23751 | 7.1% |
| B | 23751 | 7.1% |
| 1 | 23552 | 7.0% |
| Other values (8) | 29813 |
country
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 42 | 42 |
| Distinct (%) | 0.2% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
| United States | |
|---|---|
| United Kingdom | |
| Netherlands | |
| China | |
| Denmark | |
| Other values (37) |
| United States | |
|---|---|
| United Kingdom | |
| Netherlands | |
| China | |
| Denmark | |
| Other values (37) |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 28 | 28 |
| Median length | 17 | 17 |
| Mean length | 9.6552717 | 9.6121835 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 3 | 3 ? |
| Unique (%) | < 0.1% | < 0.1% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Italy | Italy |
| 2nd row | Italy | Italy |
| 3rd row | Italy | Italy |
| 4th row | Italy | Italy |
| 5th row | Italy | Italy |
Common Values
| Value | Count | Frequency (%) |
| United States | 5350 | |
| United Kingdom | 3087 | |
| Netherlands | 1736 | 7.9% |
| China | 1673 | 7.6% |
| Denmark | 1301 | 5.9% |
| Germany | 988 | 4.5% |
| France | 915 | 4.2% |
| Israel | 900 | 4.1% |
| Italy | 853 | 3.9% |
| Japan | 696 | 3.2% |
| Other values (32) | 4382 |
| Value | Count | Frequency (%) |
| United States | 5404 | |
| United Kingdom | 3087 | |
| Netherlands | 2091 | 9.3% |
| China | 1673 | 7.4% |
| Denmark | 1301 | 5.8% |
| Germany | 988 | 4.4% |
| France | 915 | 4.1% |
| Israel | 900 | 4.0% |
| Italy | 853 | 3.8% |
| Japan | 696 | 3.1% |
| Other values (32) | 4680 |
Length
Common Values (Plot)
curated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)concatenated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| united | 8572 | |
| states | 5350 | |
| kingdom | 3087 | 9.9% |
| netherlands | 1736 | 5.6% |
| china | 1673 | 5.4% |
| denmark | 1301 | 4.2% |
| germany | 988 | 3.2% |
| france | 915 | 2.9% |
| israel | 900 | 2.9% |
| italy | 853 | 2.7% |
| Other values (38) | 5841 |
| Value | Count | Frequency (%) |
| united | 8626 | |
| states | 5404 | |
| kingdom | 3087 | 9.7% |
| netherlands | 2091 | 6.5% |
| china | 1673 | 5.2% |
| denmark | 1301 | 4.1% |
| germany | 988 | 3.1% |
| france | 915 | 2.9% |
| israel | 900 | 2.8% |
| italy | 853 | 2.7% |
| Other values (38) | 6139 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24265 | |
| n | 23170 | |
| t | 22763 | |
| a | 20438 | 9.7% |
| i | 16568 | 7.8% |
| d | 15623 | 7.4% |
| s | 9386 | 4.4% |
| 9335 | 4.4% | |
| U | 8572 | 4.1% |
| r | 7195 | 3.4% |
| Other values (36) | 53952 |
| Value | Count | Frequency (%) |
| e | 25083 | |
| n | 23606 | |
| t | 23280 | |
| a | 20928 | 9.6% |
| i | 17164 | 7.9% |
| d | 16059 | 7.4% |
| s | 9795 | 4.5% |
| 9389 | 4.3% | |
| U | 8626 | 4.0% |
| r | 7550 | 3.5% |
| Other values (36) | 55640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 211267 |
| Value | Count | Frequency (%) |
| (unknown) | 217120 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 24265 | |
| n | 23170 | |
| t | 22763 | |
| a | 20438 | 9.7% |
| i | 16568 | 7.8% |
| d | 15623 | 7.4% |
| s | 9386 | 4.4% |
| 9335 | 4.4% | |
| U | 8572 | 4.1% |
| r | 7195 | 3.4% |
| Other values (36) | 53952 |
| Value | Count | Frequency (%) |
| e | 25083 | |
| n | 23606 | |
| t | 23280 | |
| a | 20928 | 9.6% |
| i | 17164 | 7.9% |
| d | 16059 | 7.4% |
| s | 9795 | 4.5% |
| 9389 | 4.3% | |
| U | 8626 | 4.0% |
| r | 7550 | 3.5% |
| Other values (36) | 55640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 211267 |
| Value | Count | Frequency (%) |
| (unknown) | 217120 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 24265 | |
| n | 23170 | |
| t | 22763 | |
| a | 20438 | 9.7% |
| i | 16568 | 7.8% |
| d | 15623 | 7.4% |
| s | 9386 | 4.4% |
| 9335 | 4.4% | |
| U | 8572 | 4.1% |
| r | 7195 | 3.4% |
| Other values (36) | 53952 |
| Value | Count | Frequency (%) |
| e | 25083 | |
| n | 23606 | |
| t | 23280 | |
| a | 20928 | 9.6% |
| i | 17164 | 7.9% |
| d | 16059 | 7.4% |
| s | 9795 | 4.5% |
| 9389 | 4.3% | |
| U | 8626 | 4.0% |
| r | 7550 | 3.5% |
| Other values (36) | 55640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 211267 |
| Value | Count | Frequency (%) |
| (unknown) | 217120 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 24265 | |
| n | 23170 | |
| t | 22763 | |
| a | 20438 | 9.7% |
| i | 16568 | 7.8% |
| d | 15623 | 7.4% |
| s | 9386 | 4.4% |
| 9335 | 4.4% | |
| U | 8572 | 4.1% |
| r | 7195 | 3.4% |
| Other values (36) | 53952 |
| Value | Count | Frequency (%) |
| e | 25083 | |
| n | 23606 | |
| t | 23280 | |
| a | 20928 | 9.6% |
| i | 17164 | 7.9% |
| d | 16059 | 7.4% |
| s | 9795 | 4.5% |
| 9389 | 4.3% | |
| U | 8626 | 4.0% |
| r | 7550 | 3.5% |
| Other values (36) | 55640 |
country_ontology_term_id
Categorical
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.1 KiB |
| NCIT:C17234 | |
|---|---|
| NCIT:C17233 | |
| NCIT:C16903 | |
| NCIT:C16428 | |
| NCIT:C16496 | |
| Other values (37) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NCIT:C16761 |
|---|---|
| 2nd row | NCIT:C16761 |
| 3rd row | NCIT:C16761 |
| 4th row | NCIT:C16761 |
| 5th row | NCIT:C16761 |
Common Values
| Value | Count | Frequency (%) |
| NCIT:C17234 | 5350 | |
| NCIT:C17233 | 3087 | |
| NCIT:C16903 | 1736 | 7.9% |
| NCIT:C16428 | 1673 | 7.6% |
| NCIT:C16496 | 1301 | 5.9% |
| NCIT:C16636 | 988 | 4.5% |
| NCIT:C16592 | 915 | 4.2% |
| NCIT:C16760 | 900 | 4.1% |
| NCIT:C16761 | 853 | 3.9% |
| NCIT:C16764 | 696 | 3.2% |
| Other values (32) | 4382 |
Length
| Value | Count | Frequency (%) |
| ncit:c17234 | 5350 | |
| ncit:c17233 | 3087 | |
| ncit:c16903 | 1736 | 7.9% |
| ncit:c16428 | 1673 | 7.6% |
| ncit:c16496 | 1301 | 5.9% |
| ncit:c16636 | 988 | 4.5% |
| ncit:c16592 | 915 | 4.2% |
| ncit:c16760 | 900 | 4.1% |
| ncit:c16761 | 853 | 3.9% |
| ncit:c16764 | 696 | 3.2% |
| Other values (32) | 4382 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 43762 | |
| 1 | 24923 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 6 | 17818 | |
| 3 | 15499 | 6.4% |
| 7 | 13733 | 5.7% |
| 2 | 12824 | 5.3% |
| Other values (5) | 24608 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 240691 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| 1 | 24923 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 6 | 17818 | |
| 3 | 15499 | 6.4% |
| 7 | 13733 | 5.7% |
| 2 | 12824 | 5.3% |
| Other values (5) | 24608 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 240691 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| 1 | 24923 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 6 | 17818 | |
| 3 | 15499 | 6.4% |
| 7 | 13733 | 5.7% |
| 2 | 12824 | 5.3% |
| Other values (5) | 24608 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 240691 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| 1 | 24923 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 6 | 17818 | |
| 3 | 15499 | 6.4% |
| 7 | 13733 | 5.7% |
| 2 | 12824 | 5.3% |
| Other values (5) | 24608 |
dietary_restriction
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | 0.7% | 0.7% |
| Missing | 21464 | 22171 |
| Missing (%) | 98.1% | 98.2% |
| Memory size | 171.1 KiB | 176.6 KiB |
| omnivore | |
|---|---|
| vegetarian | |
| vegan |
| omnivore | |
|---|---|
| vegetarian | |
| vegan |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 8 | 8 |
| Mean length | 7.9760192 | 7.9760192 |
| Min length | 5 | 5 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | omnivore | omnivore |
| 2nd row | omnivore | omnivore |
| 3rd row | omnivore | omnivore |
| 4th row | omnivore | omnivore |
| 5th row | omnivore | omnivore |
Common Values
| Value | Count | Frequency (%) |
| omnivore | 332 | 1.5% |
| vegetarian | 49 | 0.2% |
| vegan | 36 | 0.2% |
| (Missing) | 21464 |
| Value | Count | Frequency (%) |
| omnivore | 332 | 1.5% |
| vegetarian | 49 | 0.2% |
| vegan | 36 | 0.2% |
| (Missing) | 22171 |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| omnivore | 332 | |
| vegetarian | 49 | 11.8% |
| vegan | 36 | 8.6% |
| Value | Count | Frequency (%) |
| omnivore | 332 | |
| vegetarian | 49 | 11.8% |
| vegan | 36 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
| Value | Count | Frequency (%) |
| (unknown) | 3326 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
| Value | Count | Frequency (%) |
| o | 664 | |
| e | 466 | |
| n | 417 | |
| v | 417 | |
| i | 381 | |
| r | 381 | |
| m | 332 | |
| a | 134 | 4.0% |
| g | 85 | 2.6% |
| t | 49 | 1.5% |
feces_phenotype_metric
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | 0.3% | 0.3% |
| Missing | 20784 | 21491 |
| Missing (%) | 95.0% | 95.1% |
| Memory size | 171.1 KiB | 176.6 KiB |
| Bristol stool form score (observable entity) | |
|---|---|
| Calprotectin Measurement | |
| Calprotectin Measurement;Harvey-Bradshaw Index Clinical Classification | 80 |
| Bristol stool form score (observable entity) | |
|---|---|
| Calprotectin Measurement | |
| Calprotectin Measurement;Harvey-Bradshaw Index Clinical Classification | 80 |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 70 | 70 |
| Median length | 44 | 44 |
| Mean length | 42.559708 | 42.559708 |
| Min length | 24 | 24 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Calprotectin Measurement | Calprotectin Measurement |
| 2nd row | Calprotectin Measurement | Calprotectin Measurement |
| 3rd row | Calprotectin Measurement | Calprotectin Measurement |
| 4th row | Calprotectin Measurement | Calprotectin Measurement |
| 5th row | Calprotectin Measurement | Calprotectin Measurement |
Common Values
| Value | Count | Frequency (%) |
| Bristol stool form score (observable entity) | 834 | 3.8% |
| Calprotectin Measurement | 183 | 0.8% |
| Calprotectin Measurement;Harvey-Bradshaw Index Clinical Classification | 80 | 0.4% |
| (Missing) | 20784 |
| Value | Count | Frequency (%) |
| Bristol stool form score (observable entity) | 834 | 3.7% |
| Calprotectin Measurement | 183 | 0.8% |
| Calprotectin Measurement;Harvey-Bradshaw Index Clinical Classification | 80 | 0.4% |
| (Missing) | 21491 |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| bristol | 834 | |
| stool | 834 | |
| form | 834 | |
| score | 834 | |
| observable | 834 | |
| entity | 834 | |
| calprotectin | 263 | 4.6% |
| measurement | 183 | 3.2% |
| measurement;harvey-bradshaw | 80 | 1.4% |
| index | 80 | 1.4% |
| Other values (2) | 160 | 2.8% |
| Value | Count | Frequency (%) |
| bristol | 834 | |
| stool | 834 | |
| form | 834 | |
| score | 834 | |
| observable | 834 | |
| entity | 834 | |
| calprotectin | 263 | 4.6% |
| measurement | 183 | 3.2% |
| measurement;harvey-bradshaw | 80 | 1.4% |
| index | 80 | 1.4% |
| Other values (2) | 160 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
| Value | Count | Frequency (%) |
| (unknown) | 46688 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
| Value | Count | Frequency (%) |
| o | 5347 | |
| 4673 | ||
| e | 4548 | |
| t | 4205 | 9.0% |
| r | 4022 | 8.6% |
| s | 3839 | 8.2% |
| l | 3005 | 6.4% |
| i | 2331 | 5.0% |
| a | 1840 | 3.9% |
| b | 1668 | 3.6% |
| Other values (21) | 11210 |
feces_phenotype_value
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 257 | 257 |
| Distinct (%) | 23.4% | 23.4% |
| Missing | 20784 | 21491 |
| Missing (%) | 95.0% | 95.1% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 1 | 1 |
| Mean length | 2.2415679 | 2.2415679 |
| Min length | 1 | 1 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 245 | 245 ? |
| Unique (%) | 22.3% | 22.3% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | 234.1497 | 234.1497 |
| 2nd row | 6.1613 | 6.1613 |
| 3rd row | 35.26 | 35.26 |
| 4th row | 3.7909 | 3.7909 |
| 5th row | 5.0293 | 5.0293 |
| Value | Count | Frequency (%) |
| 4 | 427 | |
| 3 | 200 | |
| 2 | 67 | 6.1% |
| 6 | 67 | 6.1% |
| 5 | 46 | 4.2% |
| 1 | 21 | 1.9% |
| 0 | 10 | 0.9% |
| 7 | 6 | 0.5% |
| 70 | 2 | 0.2% |
| 188.628 | 2 | 0.2% |
| Other values (247) | 249 |
| Value | Count | Frequency (%) |
| 4 | 427 | |
| 3 | 200 | |
| 2 | 67 | 6.1% |
| 6 | 67 | 6.1% |
| 5 | 46 | 4.2% |
| 1 | 21 | 1.9% |
| 0 | 10 | 0.9% |
| 7 | 6 | 0.5% |
| 70 | 2 | 0.2% |
| 188.628 | 2 | 0.2% |
| Other values (247) | 249 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
| Value | Count | Frequency (%) |
| (unknown) | 2459 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
| Value | Count | Frequency (%) |
| 4 | 551 | |
| 3 | 339 | |
| 2 | 279 | |
| 1 | 227 | |
| 6 | 196 | 8.0% |
| 5 | 156 | 6.3% |
| . | 149 | 6.1% |
| 8 | 134 | 5.4% |
| 0 | 125 | 5.1% |
| 7 | 122 | 5.0% |
| Other values (2) | 181 | 7.4% |
feces_phenotype_metric_ontology_term_id
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 20784 |
| Missing (%) | 95.0% |
| Memory size | 171.1 KiB |
| SNOMED:443172007 | |
|---|---|
| NCIT:C82005 | |
| NCIT:C82005;NCIT:C191036 | 80 |
Length
| Max length | 24 |
|---|---|
| Median length | 16 |
| Mean length | 15.749316 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NCIT:C82005 |
|---|---|
| 2nd row | NCIT:C82005 |
| 3rd row | NCIT:C82005 |
| 4th row | NCIT:C82005 |
| 5th row | NCIT:C82005 |
Common Values
| Value | Count | Frequency (%) |
| SNOMED:443172007 | 834 | 3.8% |
| NCIT:C82005 | 183 | 0.8% |
| NCIT:C82005;NCIT:C191036 | 80 | 0.4% |
| (Missing) | 20784 |
Length
| Value | Count | Frequency (%) |
| snomed:443172007 | 834 | |
| ncit:c82005 | 183 | 16.7% |
| ncit:c82005;ncit:c191036 | 80 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2274 | |
| 7 | 1668 | 9.7% |
| 4 | 1668 | 9.7% |
| N | 1177 | 6.8% |
| : | 1177 | 6.8% |
| 2 | 1097 | 6.3% |
| 1 | 994 | 5.8% |
| 3 | 914 | 5.3% |
| S | 834 | 4.8% |
| D | 834 | 4.8% |
| Other values (11) | 4640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17277 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2274 | |
| 7 | 1668 | 9.7% |
| 4 | 1668 | 9.7% |
| N | 1177 | 6.8% |
| : | 1177 | 6.8% |
| 2 | 1097 | 6.3% |
| 1 | 994 | 5.8% |
| 3 | 914 | 5.3% |
| S | 834 | 4.8% |
| D | 834 | 4.8% |
| Other values (11) | 4640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17277 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2274 | |
| 7 | 1668 | 9.7% |
| 4 | 1668 | 9.7% |
| N | 1177 | 6.8% |
| : | 1177 | 6.8% |
| 2 | 1097 | 6.3% |
| 1 | 994 | 5.8% |
| 3 | 914 | 5.3% |
| S | 834 | 4.8% |
| D | 834 | 4.8% |
| Other values (11) | 4640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17277 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2274 | |
| 7 | 1668 | 9.7% |
| 4 | 1668 | 9.7% |
| N | 1177 | 6.8% |
| : | 1177 | 6.8% |
| 2 | 1097 | 6.3% |
| 1 | 994 | 5.8% |
| 3 | 914 | 5.3% |
| S | 834 | 4.8% |
| D | 834 | 4.8% |
| Other values (11) | 4640 |
fmt_role
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 21725 |
| Missing (%) | 99.3% |
| Memory size | 171.1 KiB |
| Recipient (after procedure) | |
|---|---|
| Recipient (before procedure) | |
| Donor |
Length
| Max length | 28 |
|---|---|
| Median length | 27 |
| Mean length | 25.532051 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Recipient (before procedure) |
|---|---|
| 2nd row | Recipient (before procedure) |
| 3rd row | Recipient (before procedure) |
| 4th row | Recipient (before procedure) |
| 5th row | Recipient (before procedure) |
Common Values
| Value | Count | Frequency (%) |
| Recipient (after procedure) | 109 | 0.5% |
| Recipient (before procedure) | 35 | 0.2% |
| Donor | 12 | 0.1% |
| (Missing) | 21725 |
Length
| Value | Count | Frequency (%) |
| recipient | 144 | |
| procedure | 144 | |
| after | 109 | |
| before | 35 | 7.9% |
| donor | 12 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 755 | |
| r | 444 | |
| c | 288 | 7.2% |
| i | 288 | 7.2% |
| p | 288 | 7.2% |
| 288 | 7.2% | |
| t | 253 | 6.4% |
| o | 203 | 5.1% |
| n | 156 | 3.9% |
| R | 144 | 3.6% |
| Other values (8) | 876 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3983 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 755 | |
| r | 444 | |
| c | 288 | 7.2% |
| i | 288 | 7.2% |
| p | 288 | 7.2% |
| 288 | 7.2% | |
| t | 253 | 6.4% |
| o | 203 | 5.1% |
| n | 156 | 3.9% |
| R | 144 | 3.6% |
| Other values (8) | 876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3983 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 755 | |
| r | 444 | |
| c | 288 | 7.2% |
| i | 288 | 7.2% |
| p | 288 | 7.2% |
| 288 | 7.2% | |
| t | 253 | 6.4% |
| o | 203 | 5.1% |
| n | 156 | 3.9% |
| R | 144 | 3.6% |
| Other values (8) | 876 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3983 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 755 | |
| r | 444 | |
| c | 288 | 7.2% |
| i | 288 | 7.2% |
| p | 288 | 7.2% |
| 288 | 7.2% | |
| t | 253 | 6.4% |
| o | 203 | 5.1% |
| n | 156 | 3.9% |
| R | 144 | 3.6% |
| Other values (8) | 876 |
fmt_id
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 45 | 45 |
| Distinct (%) | 31.0% | 31.0% |
| Missing | 21736 | 22443 |
| Missing (%) | 99.3% | 99.4% |
| Memory size | 171.1 KiB | 176.6 KiB |
| IaniroG_2020_2022_287 | 5 |
|---|---|
| IaniroG_2020_2022_281 | 5 |
| IaniroG_2020_2022_288 | 5 |
| IaniroG_2020_2022_286 | 5 |
| IaniroG_2020_2022_277 | 5 |
| Other values (40) |
| IaniroG_2020_2022_287 | 5 |
|---|---|
| IaniroG_2020_2022_281 | 5 |
| IaniroG_2020_2022_288 | 5 |
| IaniroG_2020_2022_286 | 5 |
| IaniroG_2020_2022_277 | 5 |
| Other values (40) |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 285 | 285 |
| Median length | 21 | 21 |
| Mean length | 24.793103 | 24.793103 |
| Min length | 21 | 21 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 8 | 8 ? |
| Unique (%) | 5.5% | 5.5% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | IaniroG_2020_2022_260 | IaniroG_2020_2022_260 |
| 2nd row | IaniroG_2020_2022_264 | IaniroG_2020_2022_264 |
| 3rd row | IaniroG_2020_2022_267 | IaniroG_2020_2022_267 |
| 4th row | IaniroG_2020_2022_268 | IaniroG_2020_2022_268 |
| 5th row | IaniroG_2020_2022_262 | IaniroG_2020_2022_262 |
Common Values
| Value | Count | Frequency (%) |
| IaniroG_2020_2022_287 | 5 | < 0.1% |
| IaniroG_2020_2022_281 | 5 | < 0.1% |
| IaniroG_2020_2022_288 | 5 | < 0.1% |
| IaniroG_2020_2022_286 | 5 | < 0.1% |
| IaniroG_2020_2022_277 | 5 | < 0.1% |
| IaniroG_2020_2022_273 | 5 | < 0.1% |
| IaniroG_2020_2022_284 | 5 | < 0.1% |
| IaniroG_2020_2022_278 | 5 | < 0.1% |
| IaniroG_2020_2022_274 | 5 | < 0.1% |
| IaniroG_2020_2022_285 | 5 | < 0.1% |
| Other values (35) | 95 | 0.4% |
| (Missing) | 21736 |
| Value | Count | Frequency (%) |
| IaniroG_2020_2022_287 | 5 | < 0.1% |
| IaniroG_2020_2022_281 | 5 | < 0.1% |
| IaniroG_2020_2022_288 | 5 | < 0.1% |
| IaniroG_2020_2022_286 | 5 | < 0.1% |
| IaniroG_2020_2022_277 | 5 | < 0.1% |
| IaniroG_2020_2022_273 | 5 | < 0.1% |
| IaniroG_2020_2022_284 | 5 | < 0.1% |
| IaniroG_2020_2022_278 | 5 | < 0.1% |
| IaniroG_2020_2022_274 | 5 | < 0.1% |
| IaniroG_2020_2022_285 | 5 | < 0.1% |
| Other values (35) | 95 | 0.4% |
| (Missing) | 22443 |
Length
Common Values (Plot)
curated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)concatenated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| ianirog_2020_2022_287 | 5 | 3.4% |
| ianirog_2020_2022_288 | 5 | 3.4% |
| ianirog_2020_2022_286 | 5 | 3.4% |
| ianirog_2020_2022_277 | 5 | 3.4% |
| ianirog_2020_2022_273 | 5 | 3.4% |
| ianirog_2020_2022_284 | 5 | 3.4% |
| ianirog_2020_2022_278 | 5 | 3.4% |
| ianirog_2020_2022_274 | 5 | 3.4% |
| ianirog_2020_2022_285 | 5 | 3.4% |
| ianirog_2020_2022_271 | 5 | 3.4% |
| Other values (35) | 95 |
| Value | Count | Frequency (%) |
| ianirog_2020_2022_287 | 5 | 3.4% |
| ianirog_2020_2022_288 | 5 | 3.4% |
| ianirog_2020_2022_286 | 5 | 3.4% |
| ianirog_2020_2022_277 | 5 | 3.4% |
| ianirog_2020_2022_273 | 5 | 3.4% |
| ianirog_2020_2022_284 | 5 | 3.4% |
| ianirog_2020_2022_278 | 5 | 3.4% |
| ianirog_2020_2022_274 | 5 | 3.4% |
| ianirog_2020_2022_285 | 5 | 3.4% |
| ianirog_2020_2022_271 | 5 | 3.4% |
| Other values (35) | 95 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
| Value | Count | Frequency (%) |
| (unknown) | 3595 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
| Value | Count | Frequency (%) |
| 2 | 1037 | |
| 0 | 526 | |
| _ | 510 | |
| G | 170 | 4.7% |
| a | 170 | 4.7% |
| I | 170 | 4.7% |
| o | 170 | 4.7% |
| r | 170 | 4.7% |
| i | 170 | 4.7% |
| n | 170 | 4.7% |
| Other values (9) | 332 | 9.2% |
sex
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 2558 | 2558 |
| Missing (%) | 11.7% | 11.3% |
| Memory size | 171.1 KiB | 176.6 KiB |
| Female | |
|---|---|
| Male |
| Female | |
|---|---|
| Male |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 5.0032604 | 5.0141787 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Female | Female |
| 2nd row | Male | Male |
| 3rd row | Male | Male |
| 4th row | Male | Male |
| 5th row | Male | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 9693 | |
| Male | 9630 | |
| (Missing) | 2558 | 11.7% |
| Value | Count | Frequency (%) |
| Female | 10157 | |
| Male | 9873 | |
| (Missing) | 2558 | 11.3% |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| female | 9693 | |
| male | 9630 |
| Value | Count | Frequency (%) |
| female | 10157 | |
| male | 9873 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 29016 | |
| a | 19323 | |
| l | 19323 | |
| F | 9693 | 10.0% |
| m | 9693 | 10.0% |
| M | 9630 | 10.0% |
| Value | Count | Frequency (%) |
| e | 30187 | |
| a | 20030 | |
| l | 20030 | |
| F | 10157 | 10.1% |
| m | 10157 | 10.1% |
| M | 9873 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 96678 |
| Value | Count | Frequency (%) |
| (unknown) | 100434 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 29016 | |
| a | 19323 | |
| l | 19323 | |
| F | 9693 | 10.0% |
| m | 9693 | 10.0% |
| M | 9630 | 10.0% |
| Value | Count | Frequency (%) |
| e | 30187 | |
| a | 20030 | |
| l | 20030 | |
| F | 10157 | 10.1% |
| m | 10157 | 10.1% |
| M | 9873 | 9.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 96678 |
| Value | Count | Frequency (%) |
| (unknown) | 100434 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 29016 | |
| a | 19323 | |
| l | 19323 | |
| F | 9693 | 10.0% |
| m | 9693 | 10.0% |
| M | 9630 | 10.0% |
| Value | Count | Frequency (%) |
| e | 30187 | |
| a | 20030 | |
| l | 20030 | |
| F | 10157 | 10.1% |
| m | 10157 | 10.1% |
| M | 9873 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 96678 |
| Value | Count | Frequency (%) |
| (unknown) | 100434 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 29016 | |
| a | 19323 | |
| l | 19323 | |
| F | 9693 | 10.0% |
| m | 9693 | 10.0% |
| M | 9630 | 10.0% |
| Value | Count | Frequency (%) |
| e | 30187 | |
| a | 20030 | |
| l | 20030 | |
| F | 10157 | 10.1% |
| m | 10157 | 10.1% |
| M | 9873 | 9.8% |
sex_ontology_term_id
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2558 |
| Missing (%) | 11.7% |
| Memory size | 171.1 KiB |
| NCIT:C16576 | |
|---|---|
| NCIT:C20197 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NCIT:C16576 |
|---|---|
| 2nd row | NCIT:C20197 |
| 3rd row | NCIT:C20197 |
| 4th row | NCIT:C20197 |
| 5th row | NCIT:C20197 |
Common Values
| Value | Count | Frequency (%) |
| NCIT:C16576 | 9693 | |
| NCIT:C20197 | 9630 | |
| (Missing) | 2558 | 11.7% |
Length
| Value | Count | Frequency (%) |
| ncit:c16576 | 9693 | |
| ncit:c20197 | 9630 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 38646 | |
| 6 | 19386 | |
| N | 19323 | |
| I | 19323 | |
| T | 19323 | |
| : | 19323 | |
| 1 | 19323 | |
| 7 | 19323 | |
| 5 | 9693 | 4.6% |
| 2 | 9630 | 4.5% |
| Other values (2) | 19260 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 212553 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 38646 | |
| 6 | 19386 | |
| N | 19323 | |
| I | 19323 | |
| T | 19323 | |
| : | 19323 | |
| 1 | 19323 | |
| 7 | 19323 | |
| 5 | 9693 | 4.6% |
| 2 | 9630 | 4.5% |
| Other values (2) | 19260 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 212553 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 38646 | |
| 6 | 19386 | |
| N | 19323 | |
| I | 19323 | |
| T | 19323 | |
| : | 19323 | |
| 1 | 19323 | |
| 7 | 19323 | |
| 5 | 9693 | 4.6% |
| 2 | 9630 | 4.5% |
| Other values (2) | 19260 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 212553 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 38646 | |
| 6 | 19386 | |
| N | 19323 | |
| I | 19323 | |
| T | 19323 | |
| : | 19323 | |
| 1 | 19323 | |
| 7 | 19323 | |
| 5 | 9693 | 4.6% |
| 2 | 9630 | 4.5% |
| Other values (2) | 19260 |
hla
Categorical
| Distinct | 35 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 20981 |
| Missing (%) | 95.9% |
| Memory size | 171.1 KiB |
| HLA protein complex with DQ5 serotype | |
|---|---|
| HLA-DRB1*04:01 protein complex | |
| HLA-DQA1*02:01 protein complex;HLA protein complex with DQ5 serotype | |
| HLA-DRB1*04:04 protein complex | |
| HLA protein complex with DQ5 serotype;HLA protein complex with DQ5 serotype | |
| Other values (30) |
Length
| Max length | 175 |
|---|---|
| Median length | 144 |
| Mean length | 64.896667 |
| Min length | 30 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | HLA-DQB1*03:02 protein complex;HLA-DQB1*05:01 protein complex;HLA-DRB1*04:04 protein complex |
|---|---|
| 2nd row | HLA-DQB1*03:02 protein complex;HLA-DQB1*05:01 protein complex;HLA-DRB1*04:04 protein complex |
| 3rd row | HLA-DQB1*03:02 protein complex;HLA-DQB1*05:01 protein complex;HLA-DRB1*04:04 protein complex |
| 4th row | HLA-DQB1*03:02 protein complex;HLA-DQB1*05:01 protein complex;HLA-DRB1*04:04 protein complex |
| 5th row | HLA-DQB1*03:02 protein complex;HLA-DQB1*05:01 protein complex;HLA-DRB1*04:04 protein complex |
Common Values
| Value | Count | Frequency (%) |
| HLA protein complex with DQ5 serotype | 225 | 1.0% |
| HLA-DRB1*04:01 protein complex | 102 | 0.5% |
| HLA-DQA1*02:01 protein complex;HLA protein complex with DQ5 serotype | 98 | 0.4% |
| HLA-DRB1*04:04 protein complex | 84 | 0.4% |
| HLA protein complex with DQ5 serotype;HLA protein complex with DQ5 serotype | 49 | 0.2% |
| HLA-DRB1*04:01 protein complex;HLA protein complex with DQ3 serotype;HLA protein complex with DQ5 serotype | 41 | 0.2% |
| HLA-DQB1*03:02 protein complex;HLA protein complex with DQ4 serotype;HLA-DRB1*04:01 protein complex | 34 | 0.2% |
| HLA-DRB1*04:04 protein complex;HLA protein complex with DQ3 serotype;HLA-DQA1*02:01 protein complex | 32 | 0.1% |
| HLA-DRB1*04:04 protein complex;HLA protein complex with DQ3 serotype;HLA protein complex with DQ5 serotype | 31 | 0.1% |
| HLA protein complex with DQ3 serotype;HLA protein complex with DQ5 serotype | 28 | 0.1% |
| Other values (25) | 176 | 0.8% |
| (Missing) | 20981 |
Length
| Value | Count | Frequency (%) |
| protein | 1702 | |
| complex | 1319 | |
| with | 935 | |
| dq5 | 574 | 8.1% |
| serotype | 516 | 7.3% |
| hla | 371 | 5.2% |
| complex;hla | 292 | 4.1% |
| serotype;hla | 272 | 3.8% |
| dq3 | 256 | 3.6% |
| hla-drb1*04:04 | 168 | 2.4% |
| Other values (19) | 704 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6209 | 10.6% | |
| e | 5274 | 9.0% |
| p | 4339 | 7.4% |
| o | 4339 | 7.4% |
| t | 3572 | 6.1% |
| r | 2637 | 4.5% |
| i | 2637 | 4.5% |
| A | 1856 | 3.2% |
| H | 1702 | 2.9% |
| l | 1702 | 2.9% |
| Other values (25) | 24140 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58407 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6209 | 10.6% | |
| e | 5274 | 9.0% |
| p | 4339 | 7.4% |
| o | 4339 | 7.4% |
| t | 3572 | 6.1% |
| r | 2637 | 4.5% |
| i | 2637 | 4.5% |
| A | 1856 | 3.2% |
| H | 1702 | 2.9% |
| l | 1702 | 2.9% |
| Other values (25) | 24140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58407 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6209 | 10.6% | |
| e | 5274 | 9.0% |
| p | 4339 | 7.4% |
| o | 4339 | 7.4% |
| t | 3572 | 6.1% |
| r | 2637 | 4.5% |
| i | 2637 | 4.5% |
| A | 1856 | 3.2% |
| H | 1702 | 2.9% |
| l | 1702 | 2.9% |
| Other values (25) | 24140 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58407 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6209 | 10.6% | |
| e | 5274 | 9.0% |
| p | 4339 | 7.4% |
| o | 4339 | 7.4% |
| t | 3572 | 6.1% |
| r | 2637 | 4.5% |
| i | 2637 | 4.5% |
| A | 1856 | 3.2% |
| H | 1702 | 2.9% |
| l | 1702 | 2.9% |
| Other values (25) | 24140 |
hla_ontology_term_id
Categorical
| Distinct | 35 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 20981 |
| Missing (%) | 95.9% |
| Memory size | 171.1 KiB |
| MRO:0001626 | |
|---|---|
| MRO:0001290 | |
| MRO:0001211;MRO:0001626 | |
| MRO:0001293 | |
| MRO:0001626;MRO:0001626 | |
| Other values (30) |
Length
| Max length | 59 |
|---|---|
| Median length | 47 |
| Mean length | 21.693333 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | MRO:0001240;MRO:0001247;MRO:0001293 |
|---|---|
| 2nd row | MRO:0001240;MRO:0001247;MRO:0001293 |
| 3rd row | MRO:0001240;MRO:0001247;MRO:0001293 |
| 4th row | MRO:0001240;MRO:0001247;MRO:0001293 |
| 5th row | MRO:0001240;MRO:0001247;MRO:0001293 |
Common Values
| Value | Count | Frequency (%) |
| MRO:0001626 | 225 | 1.0% |
| MRO:0001290 | 102 | 0.5% |
| MRO:0001211;MRO:0001626 | 98 | 0.4% |
| MRO:0001293 | 84 | 0.4% |
| MRO:0001626;MRO:0001626 | 49 | 0.2% |
| MRO:0001290;MRO:0001622;MRO:0001626 | 41 | 0.2% |
| MRO:0001240;MRO:0001625;MRO:0001290 | 34 | 0.2% |
| MRO:0001293;MRO:0001622;MRO:0001211 | 32 | 0.1% |
| MRO:0001293;MRO:0001622;MRO:0001626 | 31 | 0.1% |
| MRO:0001622;MRO:0001626 | 28 | 0.1% |
| Other values (25) | 176 | 0.8% |
| (Missing) | 20981 |
Length
| Value | Count | Frequency (%) |
| mro:0001626 | 225 | |
| mro:0001290 | 102 | |
| mro:0001211;mro:0001626 | 98 | |
| mro:0001293 | 84 | 9.3% |
| mro:0001626;mro:0001626 | 49 | 5.4% |
| mro:0001290;mro:0001622;mro:0001626 | 41 | 4.6% |
| mro:0001240;mro:0001625;mro:0001290 | 34 | 3.8% |
| mro:0001293;mro:0001622;mro:0001211 | 32 | 3.6% |
| mro:0001293;mro:0001622;mro:0001626 | 31 | 3.4% |
| mro:0001622;mro:0001626 | 28 | 3.1% |
| Other values (25) | 176 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5496 | |
| 1 | 2028 | 10.4% |
| 2 | 1948 | 10.0% |
| M | 1702 | 8.7% |
| R | 1702 | 8.7% |
| O | 1702 | 8.7% |
| : | 1702 | 8.7% |
| 6 | 1509 | 7.7% |
| ; | 802 | 4.1% |
| 9 | 494 | 2.5% |
| Other values (4) | 439 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19524 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5496 | |
| 1 | 2028 | 10.4% |
| 2 | 1948 | 10.0% |
| M | 1702 | 8.7% |
| R | 1702 | 8.7% |
| O | 1702 | 8.7% |
| : | 1702 | 8.7% |
| 6 | 1509 | 7.7% |
| ; | 802 | 4.1% |
| 9 | 494 | 2.5% |
| Other values (4) | 439 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19524 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5496 | |
| 1 | 2028 | 10.4% |
| 2 | 1948 | 10.0% |
| M | 1702 | 8.7% |
| R | 1702 | 8.7% |
| O | 1702 | 8.7% |
| : | 1702 | 8.7% |
| 6 | 1509 | 7.7% |
| ; | 802 | 4.1% |
| 9 | 494 | 2.5% |
| Other values (4) | 439 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19524 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5496 | |
| 1 | 2028 | 10.4% |
| 2 | 1948 | 10.0% |
| M | 1702 | 8.7% |
| R | 1702 | 8.7% |
| O | 1702 | 8.7% |
| : | 1702 | 8.7% |
| 6 | 1509 | 7.7% |
| ; | 802 | 4.1% |
| 9 | 494 | 2.5% |
| Other values (4) | 439 | 2.2% |
smoker
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 18901 | 19608 |
| Missing (%) | 86.4% | 86.8% |
| Memory size | 171.1 KiB | 176.6 KiB |
| Non-smoker (finding) | |
|---|---|
| Non-smoker (finding);Never smoked tobacco (finding) | |
| Smoker (finding) | |
| Non-smoker (finding);Ex-smoker (finding) |
| Non-smoker (finding) | |
|---|---|
| Non-smoker (finding);Never smoked tobacco (finding) | |
| Smoker (finding) | |
| Non-smoker (finding);Ex-smoker (finding) |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 51 | 51 |
| Median length | 20 | 20 |
| Mean length | 28.798993 | 28.798993 |
| Min length | 16 | 16 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Non-smoker (finding);Never smoked tobacco (finding) | Non-smoker (finding);Never smoked tobacco (finding) |
| 2nd row | Non-smoker (finding);Never smoked tobacco (finding) | Non-smoker (finding);Never smoked tobacco (finding) |
| 3rd row | Non-smoker (finding);Never smoked tobacco (finding) | Non-smoker (finding);Never smoked tobacco (finding) |
| 4th row | Non-smoker (finding);Never smoked tobacco (finding) | Non-smoker (finding);Never smoked tobacco (finding) |
| 5th row | Non-smoker (finding);Never smoked tobacco (finding) | Non-smoker (finding);Never smoked tobacco (finding) |
Common Values
| Value | Count | Frequency (%) |
| Non-smoker (finding) | 1584 | 7.2% |
| Non-smoker (finding);Never smoked tobacco (finding) | 799 | 3.7% |
| Smoker (finding) | 437 | 2.0% |
| Non-smoker (finding);Ex-smoker (finding) | 160 | 0.7% |
| (Missing) | 18901 |
| Value | Count | Frequency (%) |
| Non-smoker (finding) | 1584 | 7.0% |
| Non-smoker (finding);Never smoked tobacco (finding) | 799 | 3.5% |
| Smoker (finding) | 437 | 1.9% |
| Non-smoker (finding);Ex-smoker (finding) | 160 | 0.7% |
| (Missing) | 19608 |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| finding | 2980 | |
| non-smoker | 2543 | |
| finding);never | 799 | 9.4% |
| smoked | 799 | 9.4% |
| tobacco | 799 | 9.4% |
| smoker | 437 | 5.1% |
| finding);ex-smoker | 160 | 1.9% |
| Value | Count | Frequency (%) |
| finding | 2980 | |
| non-smoker | 2543 | |
| finding);never | 799 | 9.4% |
| smoked | 799 | 9.4% |
| tobacco | 799 | 9.4% |
| smoker | 437 | 5.1% |
| finding);ex-smoker | 160 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
| Value | Count | Frequency (%) |
| (unknown) | 85821 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
| Value | Count | Frequency (%) |
| n | 10421 | 12.1% |
| o | 8080 | 9.4% |
| i | 7878 | 9.2% |
| e | 5537 | 6.5% |
| 5537 | 6.5% | |
| d | 4738 | 5.5% |
| g | 3939 | 4.6% |
| ) | 3939 | 4.6% |
| m | 3939 | 4.6% |
| k | 3939 | 4.6% |
| Other values (15) | 27874 |
smoker_ontology_term_id
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18901 |
| Missing (%) | 86.4% |
| Memory size | 171.1 KiB |
| SNOMED:8392000 | |
|---|---|
| SNOMED:8392000;SNOMED:266919005 | |
| SNOMED:77176002 | |
| SNOMED:8392000;SNOMED:8517006 |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 19.510067 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SNOMED:8392000;SNOMED:266919005 |
|---|---|
| 2nd row | SNOMED:8392000;SNOMED:266919005 |
| 3rd row | SNOMED:8392000;SNOMED:266919005 |
| 4th row | SNOMED:8392000;SNOMED:266919005 |
| 5th row | SNOMED:8392000;SNOMED:266919005 |
Common Values
| Value | Count | Frequency (%) |
| SNOMED:8392000 | 1584 | 7.2% |
| SNOMED:8392000;SNOMED:266919005 | 799 | 3.7% |
| SNOMED:77176002 | 437 | 2.0% |
| SNOMED:8392000;SNOMED:8517006 | 160 | 0.7% |
| (Missing) | 18901 |
Length
| Value | Count | Frequency (%) |
| snomed:8392000 | 1584 | |
| snomed:8392000;snomed:266919005 | 799 | |
| snomed:77176002 | 437 | 14.7% |
| snomed:8392000;snomed:8517006 | 160 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10421 | |
| 9 | 4141 | 7.1% |
| S | 3939 | 6.8% |
| O | 3939 | 6.8% |
| M | 3939 | 6.8% |
| E | 3939 | 6.8% |
| D | 3939 | 6.8% |
| : | 3939 | 6.8% |
| N | 3939 | 6.8% |
| 2 | 3779 | 6.5% |
| Other values (7) | 12226 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58140 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10421 | |
| 9 | 4141 | 7.1% |
| S | 3939 | 6.8% |
| O | 3939 | 6.8% |
| M | 3939 | 6.8% |
| E | 3939 | 6.8% |
| D | 3939 | 6.8% |
| : | 3939 | 6.8% |
| N | 3939 | 6.8% |
| 2 | 3779 | 6.5% |
| Other values (7) | 12226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58140 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10421 | |
| 9 | 4141 | 7.1% |
| S | 3939 | 6.8% |
| O | 3939 | 6.8% |
| M | 3939 | 6.8% |
| E | 3939 | 6.8% |
| D | 3939 | 6.8% |
| : | 3939 | 6.8% |
| N | 3939 | 6.8% |
| 2 | 3779 | 6.5% |
| Other values (7) | 12226 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58140 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10421 | |
| 9 | 4141 | 7.1% |
| S | 3939 | 6.8% |
| O | 3939 | 6.8% |
| M | 3939 | 6.8% |
| E | 3939 | 6.8% |
| D | 3939 | 6.8% |
| : | 3939 | 6.8% |
| N | 3939 | 6.8% |
| 2 | 3779 | 6.5% |
| Other values (7) | 12226 |
control
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 707 |
| Missing (%) | 0.0% | 3.1% |
| Memory size | 171.1 KiB | 176.6 KiB |
| Study Control | |
|---|---|
| Case | |
| Not Used | 25 |
| Study Control | |
|---|---|
| Case | |
| Not Used | 25 |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 13 | 13 |
| Mean length | 10.101092 | 10.101092 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Study Control | Study Control |
| 2nd row | Study Control | Study Control |
| 3rd row | Study Control | Study Control |
| 4th row | Study Control | Study Control |
| 5th row | Study Control | Study Control |
Common Values
| Value | Count | Frequency (%) |
| Study Control | 14822 | |
| Case | 7034 | |
| Not Used | 25 | 0.1% |
| Value | Count | Frequency (%) |
| Study Control | 14822 | |
| Case | 7034 | |
| Not Used | 25 | 0.1% |
| (Missing) | 707 | 3.1% |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| study | 14822 | |
| control | 14822 | |
| case | 7034 | |
| not | 25 | 0.1% |
| used | 25 | 0.1% |
| Value | Count | Frequency (%) |
| study | 14822 | |
| control | 14822 | |
| case | 7034 | |
| not | 25 | 0.1% |
| used | 25 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
| Value | Count | Frequency (%) |
| (unknown) | 221022 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
| Value | Count | Frequency (%) |
| t | 29669 | |
| o | 29669 | |
| C | 21856 | |
| d | 14847 | |
| 14847 | ||
| S | 14822 | |
| u | 14822 | |
| y | 14822 | |
| n | 14822 | |
| r | 14822 | |
| Other values (6) | 36024 |
control_ontology_term_id
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.1 KiB |
| NCIT:C142703 | |
|---|---|
| NCIT:C49152 | |
| NCIT:C69062 | 25 |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.677391 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NCIT:C142703 |
|---|---|
| 2nd row | NCIT:C142703 |
| 3rd row | NCIT:C142703 |
| 4th row | NCIT:C142703 |
| 5th row | NCIT:C142703 |
Common Values
| Value | Count | Frequency (%) |
| NCIT:C142703 | 14822 | |
| NCIT:C49152 | 7034 | |
| NCIT:C69062 | 25 | 0.1% |
Length
| Value | Count | Frequency (%) |
| ncit:c142703 | 14822 | |
| ncit:c49152 | 7034 | |
| ncit:c69062 | 25 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 43762 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 2 | 21881 | |
| 1 | 21856 | |
| 4 | 21856 | |
| 0 | 14847 | 5.8% |
| 7 | 14822 | 5.8% |
| Other values (4) | 28965 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 255513 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 2 | 21881 | |
| 1 | 21856 | |
| 4 | 21856 | |
| 0 | 14847 | 5.8% |
| 7 | 14822 | 5.8% |
| Other values (4) | 28965 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 255513 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 2 | 21881 | |
| 1 | 21856 | |
| 4 | 21856 | |
| 0 | 14847 | 5.8% |
| 7 | 14822 | 5.8% |
| Other values (4) | 28965 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 255513 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 43762 | |
| N | 21881 | |
| I | 21881 | |
| T | 21881 | |
| : | 21881 | |
| 2 | 21881 | |
| 1 | 21856 | |
| 4 | 21856 | |
| 0 | 14847 | 5.8% |
| 7 | 14822 | 5.8% |
| Other values (4) | 28965 |
target_condition
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 47 | 47 |
| Distinct (%) | 0.2% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
| human gut microbiome | |
|---|---|
| Inflammatory Bowel Disease | |
| abnormal glucose tolerance;Metabolic Syndrome;control;Type 2 Diabetes Mellitus;Heart Failure | |
| human microbiome | |
| otitis;pneumonia;bronchitis;Respiratory tract infection;sepsis;Skin Infection;Cough;gastroenteritis;Tonsillitis;pyelonephritis;cystitis;Fever;Infection;stomatitis;salmonellosis | 785 |
| Other values (42) |
| human gut microbiome | |
|---|---|
| Inflammatory Bowel Disease | |
| abnormal glucose tolerance;Metabolic Syndrome;control;Type 2 Diabetes Mellitus;Heart Failure | |
| human microbiome | |
| otitis;pneumonia;bronchitis;Respiratory tract infection;sepsis;Skin Infection;Cough;gastroenteritis;Tonsillitis;pyelonephritis;cystitis;Fever;Infection;stomatitis;salmonellosis | 785 |
| Other values (42) |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 176 | 176 |
| Median length | 92 | 92 |
| Mean length | 35.062246 | 34.593412 |
| Min length | 8 | 8 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | human gut microbiome | human gut microbiome |
| 2nd row | human gut microbiome | human gut microbiome |
| 3rd row | human gut microbiome | human gut microbiome |
| 4th row | human gut microbiome | human gut microbiome |
| 5th row | human gut microbiome | human gut microbiome |
Common Values
| Value | Count | Frequency (%) |
| human gut microbiome | 8306 | |
| Inflammatory Bowel Disease | 2282 | 10.4% |
| abnormal glucose tolerance;Metabolic Syndrome;control;Type 2 Diabetes Mellitus;Heart Failure | 1831 | 8.4% |
| human microbiome | 860 | 3.9% |
| otitis;pneumonia;bronchitis;Respiratory tract infection;sepsis;Skin Infection;Cough;gastroenteritis;Tonsillitis;pyelonephritis;cystitis;Fever;Infection;stomatitis;salmonellosis | 785 | 3.6% |
| colorectal cancer;Adenoma | 616 | 2.8% |
| colorectal cancer | 503 | 2.3% |
| premature birth | 453 | 2.1% |
| abnormal glucose tolerance;Type 2 Diabetes Mellitus | 441 | 2.0% |
| Type 2 Diabetes Mellitus | 400 | 1.8% |
| Other values (37) | 5404 |
| Value | Count | Frequency (%) |
| human gut microbiome | 8306 | |
| Inflammatory Bowel Disease | 2637 | 11.7% |
| abnormal glucose tolerance;Metabolic Syndrome;control;Type 2 Diabetes Mellitus;Heart Failure | 1831 | 8.1% |
| human microbiome | 860 | 3.8% |
| otitis;pneumonia;bronchitis;Respiratory tract infection;sepsis;Skin Infection;Cough;gastroenteritis;Tonsillitis;pyelonephritis;cystitis;Fever;Infection;stomatitis;salmonellosis | 785 | 3.5% |
| colorectal cancer;Adenoma | 642 | 2.8% |
| colorectal cancer | 503 | 2.2% |
| Schizophrenia | 502 | 2.2% |
| premature birth | 453 | 2.0% |
| abnormal glucose tolerance;Type 2 Diabetes Mellitus | 441 | 2.0% |
| Other values (37) | 5628 |
Length
Common Values (Plot)
curated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)concatenated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| human | 9861 | 13.4% |
| microbiome | 9792 | 13.3% |
| gut | 8710 | 11.8% |
| diabetes | 3365 | 4.6% |
| disease | 3024 | 4.1% |
| 2 | 2932 | 4.0% |
| bowel | 2801 | 3.8% |
| inflammatory | 2376 | 3.2% |
| glucose | 2272 | 3.1% |
| abnormal | 2272 | 3.1% |
| Other values (79) | 26155 |
| Value | Count | Frequency (%) |
| human | 9861 | 13.1% |
| microbiome | 9792 | 13.1% |
| gut | 8710 | 11.6% |
| disease | 3379 | 4.5% |
| diabetes | 3365 | 4.5% |
| bowel | 3156 | 4.2% |
| 2 | 2932 | 3.9% |
| inflammatory | 2731 | 3.6% |
| abnormal | 2272 | 3.0% |
| glucose | 2272 | 3.0% |
| Other values (79) | 26560 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 69579 | 9.1% |
| i | 64933 | 8.5% |
| o | 62085 | 8.1% |
| 51679 | 6.7% | |
| a | 51318 | 6.7% |
| t | 50326 | 6.6% |
| m | 46460 | 6.1% |
| r | 42986 | 5.6% |
| n | 40443 | 5.3% |
| l | 37182 | 4.8% |
| Other values (40) | 250206 |
| Value | Count | Frequency (%) |
| e | 71075 | 9.1% |
| i | 65913 | 8.4% |
| o | 63226 | 8.1% |
| a | 52787 | 6.8% |
| 52442 | 6.7% | |
| t | 50734 | 6.5% |
| m | 47196 | 6.0% |
| r | 43746 | 5.6% |
| n | 41149 | 5.3% |
| l | 37998 | 4.9% |
| Other values (40) | 255130 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 767197 |
| Value | Count | Frequency (%) |
| (unknown) | 781396 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 69579 | 9.1% |
| i | 64933 | 8.5% |
| o | 62085 | 8.1% |
| 51679 | 6.7% | |
| a | 51318 | 6.7% |
| t | 50326 | 6.6% |
| m | 46460 | 6.1% |
| r | 42986 | 5.6% |
| n | 40443 | 5.3% |
| l | 37182 | 4.8% |
| Other values (40) | 250206 |
| Value | Count | Frequency (%) |
| e | 71075 | 9.1% |
| i | 65913 | 8.4% |
| o | 63226 | 8.1% |
| a | 52787 | 6.8% |
| 52442 | 6.7% | |
| t | 50734 | 6.5% |
| m | 47196 | 6.0% |
| r | 43746 | 5.6% |
| n | 41149 | 5.3% |
| l | 37998 | 4.9% |
| Other values (40) | 255130 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 767197 |
| Value | Count | Frequency (%) |
| (unknown) | 781396 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 69579 | 9.1% |
| i | 64933 | 8.5% |
| o | 62085 | 8.1% |
| 51679 | 6.7% | |
| a | 51318 | 6.7% |
| t | 50326 | 6.6% |
| m | 46460 | 6.1% |
| r | 42986 | 5.6% |
| n | 40443 | 5.3% |
| l | 37182 | 4.8% |
| Other values (40) | 250206 |
| Value | Count | Frequency (%) |
| e | 71075 | 9.1% |
| i | 65913 | 8.4% |
| o | 63226 | 8.1% |
| a | 52787 | 6.8% |
| 52442 | 6.7% | |
| t | 50734 | 6.5% |
| m | 47196 | 6.0% |
| r | 43746 | 5.6% |
| n | 41149 | 5.3% |
| l | 37998 | 4.9% |
| Other values (40) | 255130 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 767197 |
| Value | Count | Frequency (%) |
| (unknown) | 781396 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 69579 | 9.1% |
| i | 64933 | 8.5% |
| o | 62085 | 8.1% |
| 51679 | 6.7% | |
| a | 51318 | 6.7% |
| t | 50326 | 6.6% |
| m | 46460 | 6.1% |
| r | 42986 | 5.6% |
| n | 40443 | 5.3% |
| l | 37182 | 4.8% |
| Other values (40) | 250206 |
| Value | Count | Frequency (%) |
| e | 71075 | 9.1% |
| i | 65913 | 8.4% |
| o | 63226 | 8.1% |
| a | 52787 | 6.8% |
| 52442 | 6.7% | |
| t | 50734 | 6.5% |
| m | 47196 | 6.0% |
| r | 43746 | 5.6% |
| n | 41149 | 5.3% |
| l | 37998 | 4.9% |
| Other values (40) | 255130 |
target_condition_ontology_term_id
Categorical
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.1 KiB |
| OHMI:0000020 | |
|---|---|
| NCIT:C3138 | |
| EFO:0002546;NCIT:C84442;EFO:0001461;NCIT:C26747;NCIT:C50577 | |
| OHMI:0000002 | |
| SYMP:0000873;EFO:0003106;EFO:0009661;HP:0011947;MP:0005044;NCIT:C35025;HP:0012735;EFO:1001463;NCIT:C116006;EFO:1001141;EFO:1000025;HP:0001945;NCIT:C128320;EFO:0009688;MONDO:0000827 | 785 |
| Other values (42) |
Length
| Max length | 180 |
|---|---|
| Median length | 59 |
| Mean length | 23.773548 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OHMI:0000020 |
|---|---|
| 2nd row | OHMI:0000020 |
| 3rd row | OHMI:0000020 |
| 4th row | OHMI:0000020 |
| 5th row | OHMI:0000020 |
Common Values
| Value | Count | Frequency (%) |
| OHMI:0000020 | 8306 | |
| NCIT:C3138 | 2282 | 10.4% |
| EFO:0002546;NCIT:C84442;EFO:0001461;NCIT:C26747;NCIT:C50577 | 1831 | 8.4% |
| OHMI:0000002 | 860 | 3.9% |
| SYMP:0000873;EFO:0003106;EFO:0009661;HP:0011947;MP:0005044;NCIT:C35025;HP:0012735;EFO:1001463;NCIT:C116006;EFO:1001141;EFO:1000025;HP:0001945;NCIT:C128320;EFO:0009688;MONDO:0000827 | 785 | 3.6% |
| EFO:0005842;NCIT:C2855 | 616 | 2.8% |
| EFO:0005842 | 503 | 2.3% |
| EFO:0003917 | 453 | 2.1% |
| EFO:0002546;NCIT:C26747 | 441 | 2.0% |
| NCIT:C26747 | 400 | 1.8% |
| Other values (37) | 5404 |
Length
| Value | Count | Frequency (%) |
| ohmi:0000020 | 8306 | |
| ncit:c3138 | 2282 | 10.4% |
| efo:0002546;ncit:c84442;efo:0001461;ncit:c26747;ncit:c50577 | 1831 | 8.4% |
| ohmi:0000002 | 860 | 3.9% |
| symp:0000873;efo:0003106;efo:0009661;hp:0011947;mp:0005044;ncit:c35025;hp:0012735;efo:1001463;ncit:c116006;efo:1001141;efo:1000025;hp:0001945;ncit:c128320;efo:0009688;mondo:0000827 | 785 | 3.6% |
| efo:0005842;ncit:c2855 | 616 | 2.8% |
| efo:0005842 | 503 | 2.3% |
| efo:0003917 | 453 | 2.1% |
| efo:0002546;ncit:c26747 | 441 | 2.0% |
| ncit:c26747 | 400 | 1.8% |
| Other values (37) | 5404 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 125912 | |
| : | 44208 | 8.5% |
| C | 30073 | 5.8% |
| 2 | 27455 | 5.3% |
| O | 25804 | 5.0% |
| I | 25300 | 4.9% |
| 1 | 23865 | 4.6% |
| ; | 22642 | 4.4% |
| 4 | 20578 | 4.0% |
| 5 | 17103 | 3.3% |
| Other values (20) | 157249 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 520189 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 125912 | |
| : | 44208 | 8.5% |
| C | 30073 | 5.8% |
| 2 | 27455 | 5.3% |
| O | 25804 | 5.0% |
| I | 25300 | 4.9% |
| 1 | 23865 | 4.6% |
| ; | 22642 | 4.4% |
| 4 | 20578 | 4.0% |
| 5 | 17103 | 3.3% |
| Other values (20) | 157249 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 520189 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 125912 | |
| : | 44208 | 8.5% |
| C | 30073 | 5.8% |
| 2 | 27455 | 5.3% |
| O | 25804 | 5.0% |
| I | 25300 | 4.9% |
| 1 | 23865 | 4.6% |
| ; | 22642 | 4.4% |
| 4 | 20578 | 4.0% |
| 5 | 17103 | 3.3% |
| Other values (20) | 157249 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 520189 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 125912 | |
| : | 44208 | 8.5% |
| C | 30073 | 5.8% |
| 2 | 27455 | 5.3% |
| O | 25804 | 5.0% |
| I | 25300 | 4.9% |
| 1 | 23865 | 4.6% |
| ; | 22642 | 4.4% |
| 4 | 20578 | 4.0% |
| 5 | 17103 | 3.3% |
| Other values (20) | 157249 |
disease
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 206 | 206 |
| Distinct (%) | 0.9% | 0.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 142 | 142 |
| Median length | 7 | 7 |
| Mean length | 15.542708 | 15.923942 |
| Min length | 5 | 5 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 54 | 53 ? |
| Unique (%) | 0.2% | 0.2% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | Healthy | Healthy |
| 2nd row | Healthy | Healthy |
| 3rd row | Healthy | Healthy |
| 4th row | Healthy | Healthy |
| 5th row | Healthy | Healthy |
| Value | Count | Frequency (%) |
| healthy | 14133 | |
| bowel | 1739 | 4.5% |
| inflammatory | 1736 | 4.5% |
| diabetes | 1397 | 3.6% |
| mellitus | 1319 | 3.4% |
| disease | 1233 | 3.2% |
| 2 | 1206 | 3.1% |
| type | 1018 | 2.6% |
| disease;crohn's | 952 | 2.5% |
| disease;ulcerative | 741 | 1.9% |
| Other values (242) | 13089 |
| Value | Count | Frequency (%) |
| healthy | 14432 | |
| bowel | 2094 | 5.2% |
| inflammatory | 2091 | 5.2% |
| diabetes | 1397 | 3.5% |
| mellitus | 1319 | 3.3% |
| disease | 1233 | 3.0% |
| 2 | 1206 | 3.0% |
| disease;ulcerative | 1096 | 2.7% |
| colitis | 1096 | 2.7% |
| type | 1018 | 2.5% |
| Other values (242) | 13461 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 41089 | |
| a | 34662 | 10.2% |
| l | 29163 | 8.6% |
| t | 29002 | 8.5% |
| y | 19985 | 5.9% |
| h | 18098 | 5.3% |
| i | 17867 | 5.3% |
| 16682 | 4.9% | |
| s | 15896 | 4.7% |
| o | 15682 | 4.6% |
| Other values (44) | 101964 |
| Value | Count | Frequency (%) |
| e | 43297 | |
| a | 36461 | 10.1% |
| l | 30909 | 8.6% |
| t | 30501 | 8.5% |
| y | 20639 | 5.7% |
| i | 19503 | 5.4% |
| h | 18424 | 5.1% |
| 17855 | 5.0% | |
| s | 17069 | 4.7% |
| o | 16854 | 4.7% |
| Other values (44) | 108178 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 340090 |
| Value | Count | Frequency (%) |
| (unknown) | 359690 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 41089 | |
| a | 34662 | 10.2% |
| l | 29163 | 8.6% |
| t | 29002 | 8.5% |
| y | 19985 | 5.9% |
| h | 18098 | 5.3% |
| i | 17867 | 5.3% |
| 16682 | 4.9% | |
| s | 15896 | 4.7% |
| o | 15682 | 4.6% |
| Other values (44) | 101964 |
| Value | Count | Frequency (%) |
| e | 43297 | |
| a | 36461 | 10.1% |
| l | 30909 | 8.6% |
| t | 30501 | 8.5% |
| y | 20639 | 5.7% |
| i | 19503 | 5.4% |
| h | 18424 | 5.1% |
| 17855 | 5.0% | |
| s | 17069 | 4.7% |
| o | 16854 | 4.7% |
| Other values (44) | 108178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 340090 |
| Value | Count | Frequency (%) |
| (unknown) | 359690 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 41089 | |
| a | 34662 | 10.2% |
| l | 29163 | 8.6% |
| t | 29002 | 8.5% |
| y | 19985 | 5.9% |
| h | 18098 | 5.3% |
| i | 17867 | 5.3% |
| 16682 | 4.9% | |
| s | 15896 | 4.7% |
| o | 15682 | 4.6% |
| Other values (44) | 101964 |
| Value | Count | Frequency (%) |
| e | 43297 | |
| a | 36461 | 10.1% |
| l | 30909 | 8.6% |
| t | 30501 | 8.5% |
| y | 20639 | 5.7% |
| i | 19503 | 5.4% |
| h | 18424 | 5.1% |
| 17855 | 5.0% | |
| s | 17069 | 4.7% |
| o | 16854 | 4.7% |
| Other values (44) | 108178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 340090 |
| Value | Count | Frequency (%) |
| (unknown) | 359690 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 41089 | |
| a | 34662 | 10.2% |
| l | 29163 | 8.6% |
| t | 29002 | 8.5% |
| y | 19985 | 5.9% |
| h | 18098 | 5.3% |
| i | 17867 | 5.3% |
| 16682 | 4.9% | |
| s | 15896 | 4.7% |
| o | 15682 | 4.6% |
| Other values (44) | 101964 |
| Value | Count | Frequency (%) |
| e | 43297 | |
| a | 36461 | 10.1% |
| l | 30909 | 8.6% |
| t | 30501 | 8.5% |
| y | 20639 | 5.7% |
| i | 19503 | 5.4% |
| h | 18424 | 5.1% |
| 17855 | 5.0% | |
| s | 17069 | 4.7% |
| o | 16854 | 4.7% |
| Other values (44) | 108178 |
| Distinct | 206 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.1 KiB |
Length
| Max length | 67 |
|---|---|
| Median length | 12 |
| Mean length | 13.987661 |
| Min length | 9 |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | NCIT:C115935 |
|---|---|
| 2nd row | NCIT:C115935 |
| 3rd row | NCIT:C115935 |
| 4th row | NCIT:C115935 |
| 5th row | NCIT:C115935 |
| Value | Count | Frequency (%) |
| ncit:c115935 | 14133 | |
| ncit:c3138;efo:0000384 | 952 | 4.4% |
| ncit:c26747 | 893 | 4.1% |
| ncit:c3138;efo:0000729 | 741 | 3.4% |
| efo:0003917 | 448 | 2.0% |
| efo:0005842 | 442 | 2.0% |
| ncit:c84442 | 365 | 1.7% |
| efo:0002546;ncit:c84442 | 266 | 1.2% |
| efo:0002546 | 265 | 1.2% |
| efo:0003914 | 214 | 1.0% |
| Other values (196) | 3162 | 14.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 38603 | |
| 1 | 33964 | |
| 5 | 32532 | |
| : | 26216 | |
| 0 | 24195 | |
| 3 | 21922 | |
| I | 19725 | 6.4% |
| T | 19695 | 6.4% |
| N | 19511 | 6.4% |
| 9 | 17094 | 5.6% |
| Other values (23) | 52607 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 306064 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 38603 | |
| 1 | 33964 | |
| 5 | 32532 | |
| : | 26216 | |
| 0 | 24195 | |
| 3 | 21922 | |
| I | 19725 | 6.4% |
| T | 19695 | 6.4% |
| N | 19511 | 6.4% |
| 9 | 17094 | 5.6% |
| Other values (23) | 52607 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 306064 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 38603 | |
| 1 | 33964 | |
| 5 | 32532 | |
| : | 26216 | |
| 0 | 24195 | |
| 3 | 21922 | |
| I | 19725 | 6.4% |
| T | 19695 | 6.4% |
| N | 19511 | 6.4% |
| 9 | 17094 | 5.6% |
| Other values (23) | 52607 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 306064 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 38603 | |
| 1 | 33964 | |
| 5 | 32532 | |
| : | 26216 | |
| 0 | 24195 | |
| 3 | 21922 | |
| I | 19725 | 6.4% |
| T | 19695 | 6.4% |
| N | 19511 | 6.4% |
| 9 | 17094 | 5.6% |
| Other values (23) | 52607 |
antibiotics_current_use
Boolean
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 7306 | 7932 |
| Missing (%) | 33.4% | 35.1% |
| Memory size | 42.9 KiB | 44.2 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 12632 | |
| True | 1943 | 8.9% |
| (Missing) | 7306 |
| Value | Count | Frequency (%) |
| False | 12713 | |
| True | 1943 | 8.6% |
| (Missing) | 7932 |
curated_md_report
concatenated_md_report
treatment
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 947 | 947 |
| Distinct (%) | 40.3% | 40.3% |
| Missing | 19534 | 20241 |
| Missing (%) | 89.3% | 89.6% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 364 | 364 |
| Median length | 259 | 259 |
| Mean length | 85.180656 | 85.180656 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 730 | 730 ? |
| Unique (%) | 31.1% | 31.1% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | second generation antipsychotic | second generation antipsychotic |
| 2nd row | second generation antipsychotic | second generation antipsychotic |
| 3rd row | second generation antipsychotic | second generation antipsychotic |
| 4th row | second generation antipsychotic | second generation antipsychotic |
| 5th row | Dopamine Antagonist | Dopamine Antagonist |
| Value | Count | Frequency (%) |
| antilipidemic | 707 | 5.8% |
| agent;antihypertensive | 607 | 5.0% |
| antihypertensive | 492 | 4.0% |
| receptor | 469 | 3.8% |
| agents;anti-diabetic | 454 | 3.7% |
| inhibitor | 441 | 3.6% |
| pump | 420 | 3.4% |
| antibiotic | 406 | 3.3% |
| ii | 403 | 3.3% |
| enzyme | 377 | 3.1% |
| Other values (591) | 7444 |
| Value | Count | Frequency (%) |
| antilipidemic | 707 | 5.8% |
| agent;antihypertensive | 607 | 5.0% |
| antihypertensive | 492 | 4.0% |
| receptor | 469 | 3.8% |
| agents;anti-diabetic | 454 | 3.7% |
| inhibitor | 441 | 3.6% |
| pump | 420 | 3.4% |
| antibiotic | 406 | 3.3% |
| ii | 403 | 3.3% |
| enzyme | 377 | 3.1% |
| Other values (591) | 7444 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
| Value | Count | Frequency (%) |
| (unknown) | 199919 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
| Value | Count | Frequency (%) |
| i | 24625 | 12.3% |
| n | 20507 | 10.3% |
| t | 19733 | 9.9% |
| e | 17544 | 8.8% |
| 9873 | 4.9% | |
| o | 9208 | 4.6% |
| A | 9195 | 4.6% |
| r | 8798 | 4.4% |
| s | 7648 | 3.8% |
| ; | 7392 | 3.7% |
| Other values (45) | 65396 |
| Distinct | 948 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 16053 |
| Missing (%) | 73.4% |
| Memory size | 171.1 KiB |
Length
| Max length | 198 |
|---|---|
| Median length | 11 |
| Mean length | 25.919355 |
| Min length | 9 |
Unique
| Unique | 730 ? |
|---|---|
| Unique (%) | 12.5% |
Sample
| 1st row | NCIT:C41132 |
|---|---|
| 2nd row | NCIT:C41132 |
| 3rd row | NCIT:C41132 |
| 4th row | NCIT:C41132 |
| 5th row | NCIT:C41132 |
| Value | Count | Frequency (%) |
| ncit:c41132 | 3481 | |
| ncit:c1500 | 87 | 1.5% |
| ncit:c61612 | 84 | 1.4% |
| ncit:c29723 | 68 | 1.2% |
| ncit:c843 | 55 | 0.9% |
| ncit:c41132;ncit:c357 | 52 | 0.9% |
| ncit:c357 | 52 | 0.9% |
| ncit:c1500;ncit:c2363 | 49 | 0.8% |
| chebi:87631 | 47 | 0.8% |
| ncit:c257 | 43 | 0.7% |
| Other values (938) | 1810 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 23010 | |
| : | 13181 | 8.7% |
| 1 | 12065 | 8.0% |
| I | 11673 | 7.7% |
| 0 | 10852 | 7.2% |
| T | 10796 | 7.1% |
| N | 10314 | 6.8% |
| 2 | 9127 | 6.0% |
| 3 | 7824 | 5.2% |
| ; | 7392 | 4.9% |
| Other values (16) | 34824 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 151058 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 23010 | |
| : | 13181 | 8.7% |
| 1 | 12065 | 8.0% |
| I | 11673 | 7.7% |
| 0 | 10852 | 7.2% |
| T | 10796 | 7.1% |
| N | 10314 | 6.8% |
| 2 | 9127 | 6.0% |
| 3 | 7824 | 5.2% |
| ; | 7392 | 4.9% |
| Other values (16) | 34824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 151058 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 23010 | |
| : | 13181 | 8.7% |
| 1 | 12065 | 8.0% |
| I | 11673 | 7.7% |
| 0 | 10852 | 7.2% |
| T | 10796 | 7.1% |
| N | 10314 | 6.8% |
| 2 | 9127 | 6.0% |
| 3 | 7824 | 5.2% |
| ; | 7392 | 4.9% |
| Other values (16) | 34824 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 151058 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 23010 | |
| : | 13181 | 8.7% |
| 1 | 12065 | 8.0% |
| I | 11673 | 7.7% |
| 0 | 10852 | 7.2% |
| T | 10796 | 7.1% |
| N | 10314 | 6.8% |
| 2 | 9127 | 6.0% |
| 3 | 7824 | 5.2% |
| ; | 7392 | 4.9% |
| Other values (16) | 34824 |
tumor_staging_ajcc
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 6 | 6 |
| Distinct (%) | 1.0% | 1.0% |
| Missing | 21252 | 21959 |
| Missing (%) | 97.1% | 97.2% |
| Memory size | 171.1 KiB | 176.6 KiB |
| I | |
|---|---|
| III | |
| 0 | |
| II | |
| IV |
| I | |
|---|---|
| III | |
| 0 | |
| II | |
| IV |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 3 | 3 |
| Mean length | 2.0206677 | 2.0206677 |
| Min length | 1 | 1 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | III | III |
| 2nd row | I | I |
| 3rd row | III | III |
| 4th row | I | I |
| 5th row | II | II |
Common Values
| Value | Count | Frequency (%) |
| I | 161 | 0.7% |
| III | 127 | 0.6% |
| 0 | 113 | 0.5% |
| II | 95 | 0.4% |
| IV | 93 | 0.4% |
| III/IV | 40 | 0.2% |
| (Missing) | 21252 |
| Value | Count | Frequency (%) |
| I | 161 | 0.7% |
| III | 127 | 0.6% |
| 0 | 113 | 0.5% |
| II | 95 | 0.4% |
| IV | 93 | 0.4% |
| III/IV | 40 | 0.2% |
| (Missing) | 21959 |
Length
Common Values (Plot)
curated_md_report
concatenated_md_report
| Value | Count | Frequency (%) |
| i | 161 | |
| iii | 127 | |
| 0 | 113 | |
| ii | 95 | |
| iv | 93 | |
| iii/iv | 40 | 6.4% |
| Value | Count | Frequency (%) |
| i | 161 | |
| iii | 127 | |
| 0 | 113 | |
| ii | 95 | |
| iv | 93 | |
| iii/iv | 40 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
| Value | Count | Frequency (%) |
| (unknown) | 1271 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
| Value | Count | Frequency (%) |
| I | 985 | |
| V | 133 | 10.5% |
| 0 | 113 | 8.9% |
| / | 40 | 3.1% |
tumor_staging_tnm
Categorical
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 24 | 24 |
| Distinct (%) | 9.2% | 9.2% |
| Missing | 21619 | 22326 |
| Missing (%) | 98.8% | 98.8% |
| Memory size | 171.1 KiB | 176.6 KiB |
| t3n0m0 | |
|---|---|
| t1n0m0 | |
| t2n0m0 | |
| t3n1m0 | |
| t3n2m0 | |
| Other values (19) |
| t3n0m0 | |
|---|---|
| t1n0m0 | |
| t2n0m0 | |
| t3n1m0 | |
| t3n2m0 | |
| Other values (19) |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 6 | 6 |
| Mean length | 5.9580153 | 5.9580153 |
| Min length | 4 | 4 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 6 | 6 ? |
| Unique (%) | 2.3% | 2.3% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | t1n0m0 | t1n0m0 |
| 2nd row | t3n0m0 | t3n0m0 |
| 3rd row | t4n0m0 | t4n0m0 |
| 4th row | t3n0m0 | t3n0m0 |
| 5th row | ptis | ptis |
Common Values
| Value | Count | Frequency (%) |
| t3n0m0 | 57 | 0.3% |
| t1n0m0 | 39 | 0.2% |
| t2n0m0 | 36 | 0.2% |
| t3n1m0 | 29 | 0.1% |
| t3n2m0 | 17 | 0.1% |
| t4n1m0 | 14 | 0.1% |
| t3n1m1 | 14 | 0.1% |
| t4n1m1 | 10 | < 0.1% |
| ptis | 7 | < 0.1% |
| t2n1m0 | 6 | < 0.1% |
| Other values (14) | 33 | 0.2% |
| (Missing) | 21619 |
| Value | Count | Frequency (%) |
| t3n0m0 | 57 | 0.3% |
| t1n0m0 | 39 | 0.2% |
| t2n0m0 | 36 | 0.2% |
| t3n1m0 | 29 | 0.1% |
| t3n2m0 | 17 | 0.1% |
| t4n1m0 | 14 | 0.1% |
| t3n1m1 | 14 | 0.1% |
| t4n1m1 | 10 | < 0.1% |
| ptis | 7 | < 0.1% |
| t2n1m0 | 6 | < 0.1% |
| Other values (14) | 33 | 0.1% |
| (Missing) | 22326 |
Length
Common Values (Plot)
curated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)concatenated_md_report
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| t3n0m0 | 57 | |
| t1n0m0 | 39 | |
| t2n0m0 | 36 | |
| t3n1m0 | 29 | |
| t3n2m0 | 17 | 6.5% |
| t4n1m0 | 14 | 5.3% |
| t3n1m1 | 14 | 5.3% |
| t4n1m1 | 10 | 3.8% |
| ptis | 7 | 2.7% |
| t2n1m0 | 6 | 2.3% |
| Other values (14) | 33 |
| Value | Count | Frequency (%) |
| t3n0m0 | 57 | |
| t1n0m0 | 39 | |
| t2n0m0 | 36 | |
| t3n1m0 | 29 | |
| t3n2m0 | 17 | 6.5% |
| t4n1m0 | 14 | 5.3% |
| t3n1m1 | 14 | 5.3% |
| t4n1m1 | 10 | 3.8% |
| ptis | 7 | 2.7% |
| t2n1m0 | 6 | 2.3% |
| Other values (14) | 33 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
| Value | Count | Frequency (%) |
| (unknown) | 1561 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 361 | |
| t | 262 | |
| n | 255 | |
| m | 255 | |
| 1 | 157 | |
| 3 | 129 | 8.3% |
| 2 | 76 | 4.9% |
| 4 | 38 | 2.4% |
| i | 10 | 0.6% |
| s | 10 | 0.6% |
| Other values (2) | 8 | 0.5% |
unmetadata
['Text', 'Text']
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 231 | 231 |
| Distinct (%) | 11.2% | 11.2% |
| Missing | 19810 | 20517 |
| Missing (%) | 90.5% | 90.8% |
| Memory size | 171.1 KiB | 176.6 KiB |
Length
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Max length | 141 | 141 |
| Median length | 78 | 78 |
| Mean length | 78.084017 | 78.084017 |
| Min length | 7 | 7 |
Unique
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Unique | 182 | 182 ? |
| Unique (%) | 8.8% | 8.8% |
Sample
| curated_md_report | concatenated_md_report | |
|---|---|---|
| 1st row | travel_destination:CMR | travel_destination:CMR |
| 2nd row | travel_destination:CMR | travel_destination:CMR |
| 3rd row | travel_destination:CMR | travel_destination:CMR |
| 4th row | travel_destination:CMR | travel_destination:CMR |
| 5th row | travel_destination:CMR | travel_destination:CMR |
| Value | Count | Frequency (%) |
| uncurated_metadata:no_immuno_suppressive;no_t2d;no_t1d;no_related_treatments;no_psychiatric_diseases;no_gastro_intestinal_disorder;non_celiac | 900 | |
| uncurated_metadata:no_infection;no_cancer | 250 | 11.1% |
| fobt:no | 121 | 5.4% |
| uncurated_metadata:low_gluten_diet | 104 | 4.6% |
| uncurated_metadata:high_gluten_diet | 103 | 4.6% |
| uncurated_metadata:no_diabetes;non_celiac;no_gi_diseases | 97 | 4.3% |
| fobt:yes | 64 | 2.8% |
| given | 45 | 2.0% |
| as | 45 | 2.0% |
| 30 | 45 | 2.0% |
| Other values (225) | 477 |
| Value | Count | Frequency (%) |
| uncurated_metadata:no_immuno_suppressive;no_t2d;no_t1d;no_related_treatments;no_psychiatric_diseases;no_gastro_intestinal_disorder;non_celiac | 900 | |
| uncurated_metadata:no_infection;no_cancer | 250 | 11.1% |
| fobt:no | 121 | 5.4% |
| uncurated_metadata:low_gluten_diet | 104 | 4.6% |
| uncurated_metadata:high_gluten_diet | 103 | 4.6% |
| uncurated_metadata:no_diabetes;non_celiac;no_gi_diseases | 97 | 4.3% |
| fobt:yes | 64 | 2.8% |
| given | 45 | 2.0% |
| as | 45 | 2.0% |
| 30 | 45 | 2.0% |
| Other values (225) | 477 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
| Value | Count | Frequency (%) |
| (unknown) | 161712 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
| Value | Count | Frequency (%) |
| e | 15081 | 9.3% |
| n | 14915 | 9.2% |
| _ | 13971 | 8.6% |
| a | 13463 | 8.3% |
| t | 13088 | 8.1% |
| o | 10878 | 6.7% |
| s | 10718 | 6.6% |
| i | 9749 | 6.0% |
| r | 8396 | 5.2% |
| d | 7135 | 4.4% |
| Other values (47) | 44318 |
westernized
Boolean
| curated_md_report | concatenated_md_report | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 21.5 KiB | 22.2 KiB |
| True | |
|---|---|
| False | 1255 |
| True | |
|---|---|
| False | 1255 |
| Value | Count | Frequency (%) |
| True | 20626 | |
| False | 1255 | 5.7% |
| Value | Count | Frequency (%) |
| True | 21333 | |
| False | 1255 | 5.6% |
curated_md_report
concatenated_md_report
Interactions
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
Correlations
curated_md_report
concatenated_md_report
curated_md_report
| age_group | age_group_ontology_term_id | age_max | age_min | age_years | antibiotics_current_use | body_site | body_site_ontology_term_id | control | control_ontology_term_id | country | country_ontology_term_id | dietary_restriction | feces_phenotype_metric | feces_phenotype_metric_ontology_term_id | fmt_id | fmt_role | hla | hla_ontology_term_id | sex | sex_ontology_term_id | smoker | smoker_ontology_term_id | target_condition | target_condition_ontology_term_id | tumor_staging_ajcc | tumor_staging_tnm | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age_group | 1.000 | 1.000 | 0.711 | 0.712 | 0.844 | 0.147 | 0.099 | 0.099 | 0.175 | 0.175 | 0.431 | 0.431 | 0.332 | 0.415 | 0.415 | 0.771 | 0.103 | 0.145 | 0.145 | 0.060 | 0.060 | 0.058 | 0.058 | 0.496 | 0.496 | 0.091 | 0.146 | 0.150 |
| age_group_ontology_term_id | 1.000 | 1.000 | 0.711 | 0.712 | 0.844 | 0.147 | 0.099 | 0.099 | 0.175 | 0.175 | 0.431 | 0.431 | 0.332 | 0.415 | 0.415 | 0.771 | 0.103 | 0.145 | 0.145 | 0.060 | 0.060 | 0.058 | 0.058 | 0.496 | 0.496 | 0.091 | 0.146 | 0.150 |
| age_max | 0.711 | 0.711 | 1.000 | 0.496 | 1.000 | 0.185 | 0.157 | 0.157 | 0.155 | 0.155 | 0.355 | 0.355 | 0.457 | 0.480 | 0.480 | 0.771 | 0.103 | 1.000 | 1.000 | 0.082 | 0.082 | 0.234 | 0.234 | 0.428 | 0.428 | 0.153 | 0.146 | 0.166 |
| age_min | 0.712 | 0.712 | 0.496 | 1.000 | 1.000 | 0.222 | 0.173 | 0.173 | 0.168 | 0.168 | 0.336 | 0.336 | 0.477 | 0.331 | 0.331 | 0.790 | 0.054 | 1.000 | 1.000 | 0.098 | 0.098 | 0.238 | 0.238 | 0.418 | 0.418 | 0.178 | 0.161 | 0.174 |
| age_years | 0.844 | 0.844 | 1.000 | 1.000 | 1.000 | 0.287 | 0.149 | 0.149 | 0.285 | 0.285 | 0.315 | 0.315 | 0.472 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.111 | 0.111 | 0.416 | 0.416 | 0.416 | 0.416 | 0.091 | 0.161 | 0.134 |
| antibiotics_current_use | 0.147 | 0.147 | 0.185 | 0.222 | 0.287 | 1.000 | 0.110 | 0.110 | 0.196 | 0.196 | 0.458 | 0.458 | 0.000 | 0.295 | 0.295 | 0.812 | 0.400 | 0.159 | 0.159 | 0.011 | 0.011 | 0.384 | 0.384 | 0.530 | 0.530 | 0.064 | 1.000 | 0.061 |
| body_site | 0.099 | 0.099 | 0.157 | 0.173 | 0.149 | 0.110 | 1.000 | 1.000 | 0.148 | 0.148 | 0.289 | 0.289 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.098 | 0.098 | 0.289 | 0.289 | 0.419 | 0.419 | 1.000 | 1.000 | 0.304 |
| body_site_ontology_term_id | 0.099 | 0.099 | 0.157 | 0.173 | 0.149 | 0.110 | 1.000 | 1.000 | 0.148 | 0.148 | 0.289 | 0.289 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.098 | 0.098 | 0.289 | 0.289 | 0.419 | 0.419 | 1.000 | 1.000 | 0.304 |
| control | 0.175 | 0.175 | 0.155 | 0.168 | 0.285 | 0.196 | 0.148 | 0.148 | 1.000 | 1.000 | 0.347 | 0.347 | 0.385 | 0.414 | 0.414 | 0.836 | 0.473 | 0.355 | 0.355 | 0.055 | 0.055 | 0.358 | 0.358 | 0.533 | 0.533 | 0.527 | 1.000 | 0.124 |
| control_ontology_term_id | 0.175 | 0.175 | 0.155 | 0.168 | 0.285 | 0.196 | 0.148 | 0.148 | 1.000 | 1.000 | 0.347 | 0.347 | 0.385 | 0.414 | 0.414 | 0.836 | 0.473 | 0.355 | 0.355 | 0.055 | 0.055 | 0.358 | 0.358 | 0.533 | 0.533 | 0.527 | 1.000 | 0.124 |
| country | 0.431 | 0.431 | 0.355 | 0.336 | 0.315 | 0.458 | 0.289 | 0.289 | 0.347 | 0.347 | 1.000 | 1.000 | 0.545 | 0.787 | 0.787 | 1.000 | 1.000 | 0.442 | 0.442 | 0.195 | 0.195 | 0.543 | 0.543 | 0.552 | 0.552 | 0.243 | 0.365 | 0.976 |
| country_ontology_term_id | 0.431 | 0.431 | 0.355 | 0.336 | 0.315 | 0.458 | 0.289 | 0.289 | 0.347 | 0.347 | 1.000 | 1.000 | 0.545 | 0.787 | 0.787 | 1.000 | 1.000 | 0.442 | 0.442 | 0.195 | 0.195 | 0.543 | 0.543 | 0.552 | 0.552 | 0.243 | 0.365 | 0.976 |
| dietary_restriction | 0.332 | 0.332 | 0.457 | 0.477 | 0.472 | 0.000 | 1.000 | 1.000 | 0.385 | 0.385 | 0.545 | 0.545 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.146 | 0.146 | 1.000 | 1.000 | 0.545 | 0.545 | 1.000 | 0.000 | 1.000 |
| feces_phenotype_metric | 0.415 | 0.415 | 0.480 | 0.331 | 1.000 | 0.295 | 1.000 | 1.000 | 0.414 | 0.414 | 0.787 | 0.787 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.788 | 0.788 | 0.000 | 0.000 | 1.000 |
| feces_phenotype_metric_ontology_term_id | 0.415 | 0.415 | 0.480 | 0.331 | 1.000 | 0.295 | 1.000 | 1.000 | 0.414 | 0.414 | 0.787 | 0.787 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.788 | 0.788 | 0.000 | 0.000 | 1.000 |
| fmt_id | 0.771 | 0.771 | 0.771 | 0.790 | 0.000 | 0.812 | 1.000 | 1.000 | 0.836 | 0.836 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.340 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 |
| fmt_role | 0.103 | 0.103 | 0.103 | 0.054 | 0.000 | 0.400 | 1.000 | 1.000 | 0.473 | 0.473 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.340 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 |
| hla | 0.145 | 0.145 | 1.000 | 1.000 | 1.000 | 0.159 | 1.000 | 1.000 | 0.355 | 0.355 | 0.442 | 0.442 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.515 | 0.515 | 0.000 | 0.000 | 0.981 | 0.981 | 0.000 | 0.000 | 1.000 |
| hla_ontology_term_id | 0.145 | 0.145 | 1.000 | 1.000 | 1.000 | 0.159 | 1.000 | 1.000 | 0.355 | 0.355 | 0.442 | 0.442 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.515 | 0.515 | 0.000 | 0.000 | 0.981 | 0.981 | 0.000 | 0.000 | 1.000 |
| sex | 0.060 | 0.060 | 0.082 | 0.098 | 0.111 | 0.011 | 0.098 | 0.098 | 0.055 | 0.055 | 0.195 | 0.195 | 0.146 | 0.000 | 0.000 | 0.000 | 0.000 | 0.515 | 0.515 | 1.000 | 1.000 | 0.157 | 0.157 | 0.182 | 0.182 | 0.000 | 0.000 | 0.003 |
| sex_ontology_term_id | 0.060 | 0.060 | 0.082 | 0.098 | 0.111 | 0.011 | 0.098 | 0.098 | 0.055 | 0.055 | 0.195 | 0.195 | 0.146 | 0.000 | 0.000 | 0.000 | 0.000 | 0.515 | 0.515 | 1.000 | 1.000 | 0.157 | 0.157 | 0.182 | 0.182 | 0.000 | 0.000 | 0.003 |
| smoker | 0.058 | 0.058 | 0.234 | 0.238 | 0.416 | 0.384 | 0.289 | 0.289 | 0.358 | 0.358 | 0.543 | 0.543 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.157 | 0.157 | 1.000 | 1.000 | 0.560 | 0.560 | 0.000 | 0.000 | 0.091 |
| smoker_ontology_term_id | 0.058 | 0.058 | 0.234 | 0.238 | 0.416 | 0.384 | 0.289 | 0.289 | 0.358 | 0.358 | 0.543 | 0.543 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.157 | 0.157 | 1.000 | 1.000 | 0.560 | 0.560 | 0.000 | 0.000 | 0.091 |
| target_condition | 0.496 | 0.496 | 0.428 | 0.418 | 0.416 | 0.530 | 0.419 | 0.419 | 0.533 | 0.533 | 0.552 | 0.552 | 0.545 | 0.788 | 0.788 | 1.000 | 1.000 | 0.981 | 0.981 | 0.182 | 0.182 | 0.560 | 0.560 | 1.000 | 1.000 | 0.315 | 0.422 | 0.712 |
| target_condition_ontology_term_id | 0.496 | 0.496 | 0.428 | 0.418 | 0.416 | 0.530 | 0.419 | 0.419 | 0.533 | 0.533 | 0.552 | 0.552 | 0.545 | 0.788 | 0.788 | 1.000 | 1.000 | 0.981 | 0.981 | 0.182 | 0.182 | 0.560 | 0.560 | 1.000 | 1.000 | 0.315 | 0.422 | 0.712 |
| tumor_staging_ajcc | 0.091 | 0.091 | 0.153 | 0.178 | 0.091 | 0.064 | 1.000 | 1.000 | 0.527 | 0.527 | 0.243 | 0.243 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.315 | 0.315 | 1.000 | 0.943 | 1.000 |
| tumor_staging_tnm | 0.146 | 0.146 | 0.146 | 0.161 | 0.161 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.365 | 0.365 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.422 | 0.422 | 0.943 | 1.000 | 1.000 |
| westernized | 0.150 | 0.150 | 0.166 | 0.174 | 0.134 | 0.061 | 0.304 | 0.304 | 0.124 | 0.124 | 0.976 | 0.976 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.003 | 0.003 | 0.091 | 0.091 | 0.712 | 0.712 | 1.000 | 1.000 | 1.000 |
concatenated_md_report
| age_group | age_max | age_min | age_years | antibiotics_current_use | body_site | control | country | dietary_restriction | feces_phenotype_metric | fmt_id | sex | smoker | target_condition | tumor_staging_ajcc | tumor_staging_tnm | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age_group | 1.000 | 0.711 | 0.712 | 0.844 | 0.146 | 0.097 | 0.175 | 0.428 | 0.332 | 0.415 | 0.771 | 0.065 | 0.058 | 0.485 | 0.091 | 0.146 | 0.151 |
| age_max | 0.711 | 1.000 | 0.496 | 1.000 | 0.184 | 0.157 | 0.155 | 0.354 | 0.457 | 0.480 | 0.771 | 0.082 | 0.234 | 0.426 | 0.153 | 0.146 | 0.166 |
| age_min | 0.712 | 0.496 | 1.000 | 1.000 | 0.222 | 0.173 | 0.168 | 0.335 | 0.477 | 0.331 | 0.790 | 0.098 | 0.238 | 0.417 | 0.178 | 0.161 | 0.174 |
| age_years | 0.844 | 1.000 | 1.000 | 1.000 | 0.287 | 0.149 | 0.285 | 0.313 | 0.472 | 1.000 | 0.000 | 0.110 | 0.416 | 0.414 | 0.091 | 0.161 | 0.134 |
| antibiotics_current_use | 0.146 | 0.184 | 0.222 | 0.287 | 1.000 | 0.109 | 0.196 | 0.449 | 0.000 | 0.295 | 0.812 | 0.010 | 0.384 | 0.531 | 0.064 | 1.000 | 0.061 |
| body_site | 0.097 | 0.157 | 0.173 | 0.149 | 0.109 | 1.000 | 0.148 | 0.279 | 1.000 | 1.000 | 1.000 | 0.095 | 0.289 | 0.419 | 1.000 | 1.000 | 0.303 |
| control | 0.175 | 0.155 | 0.168 | 0.285 | 0.196 | 0.148 | 1.000 | 0.347 | 0.385 | 0.414 | 0.836 | 0.055 | 0.358 | 0.533 | 0.527 | 1.000 | 0.124 |
| country | 0.428 | 0.354 | 0.335 | 0.313 | 0.449 | 0.279 | 0.347 | 1.000 | 0.545 | 0.787 | 1.000 | 0.199 | 0.543 | 0.541 | 0.243 | 0.365 | 0.911 |
| dietary_restriction | 0.332 | 0.457 | 0.477 | 0.472 | 0.000 | 1.000 | 0.385 | 0.545 | 1.000 | 0.000 | 0.000 | 0.146 | 1.000 | 0.545 | 1.000 | 0.000 | 1.000 |
| feces_phenotype_metric | 0.415 | 0.480 | 0.331 | 1.000 | 0.295 | 1.000 | 0.414 | 0.787 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.788 | 0.000 | 0.000 | 1.000 |
| fmt_id | 0.771 | 0.771 | 0.790 | 0.000 | 0.812 | 1.000 | 0.836 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 |
| sex | 0.065 | 0.082 | 0.098 | 0.110 | 0.010 | 0.095 | 0.055 | 0.199 | 0.146 | 0.000 | 0.000 | 1.000 | 0.157 | 0.184 | 0.000 | 0.000 | 0.000 |
| smoker | 0.058 | 0.234 | 0.238 | 0.416 | 0.384 | 0.289 | 0.358 | 0.543 | 1.000 | 1.000 | 0.000 | 0.157 | 1.000 | 0.560 | 0.000 | 0.000 | 0.091 |
| target_condition | 0.485 | 0.426 | 0.417 | 0.414 | 0.531 | 0.419 | 0.533 | 0.541 | 0.545 | 0.788 | 1.000 | 0.184 | 0.560 | 1.000 | 0.315 | 0.422 | 0.707 |
| tumor_staging_ajcc | 0.091 | 0.153 | 0.178 | 0.091 | 0.064 | 1.000 | 0.527 | 0.243 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.315 | 1.000 | 0.943 | 1.000 |
| tumor_staging_tnm | 0.146 | 0.146 | 0.161 | 0.161 | 1.000 | 1.000 | 1.000 | 0.365 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.422 | 0.943 | 1.000 | 1.000 |
| westernized | 0.151 | 0.166 | 0.174 | 0.134 | 0.061 | 0.303 | 0.124 | 0.911 | 1.000 | 1.000 | 1.000 | 0.000 | 0.091 | 0.707 | 1.000 | 1.000 | 1.000 |
Missing values
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
curated_md_report
concatenated_md_report
Sample
curated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | age_group_ontology_term_id | biomarker | body_site | body_site_ontology_term_id | country | country_ontology_term_id | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | feces_phenotype_metric_ontology_term_id | fmt_role | fmt_id | sex | sex_ontology_term_id | hla | hla_ontology_term_id | smoker | smoker_ontology_term_id | control | control_ontology_term_id | target_condition | target_condition_ontology_term_id | disease | disease_ontology_term_id | antibiotics_current_use | treatment | treatment_ontology_term_id | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | AsnicarF_2017 | MV_FEI1_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 1 | AsnicarF_2017 | MV_FEI2_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 2 | AsnicarF_2017 | MV_FEI3_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 3 | AsnicarF_2017 | MV_FEI4_t1Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 4 | AsnicarF_2017 | MV_FEI4_t2Q15 | 1.000000 | 1.000000 | 1.000000 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 5 | AsnicarF_2017 | MV_FEI5_t1Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 6 | AsnicarF_2017 | MV_FEI5_t2Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 7 | AsnicarF_2017 | MV_FEI5_t3Q15 | 1.000000 | 1.000000 | 1.000000 | Infant | NCIT:C27956 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 8 | AsnicarF_2017 | MV_FEM1_t1Q14 | NaN | 18.000000 | 65.000000 | Adult | NCIT:C49685 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
| 9 | AsnicarF_2017 | MV_FEM2_t1Q14 | NaN | 18.000000 | 65.000000 | Adult | NCIT:C49685 | NaN | feces | UBERON:0001988 | Italy | NCIT:C16761 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | NaN | NaN | Study Control | NCIT:C142703 | human gut microbiome | OHMI:0000020 | Healthy | NCIT:C115935 | NaN | NaN | NaN | NaN | NaN | NaN | Yes |
concatenated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | biomarker | body_site | country | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | fmt_id | sex | smoker | control | target_condition | disease | antibiotics_current_use | treatment | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | AsnicarF_2017 | MV_FEI1_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Female | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 1 | AsnicarF_2017 | MV_FEI2_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 2 | AsnicarF_2017 | MV_FEI3_t1Q14 | 0.246575 | 0.246575 | 0.246575 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 3 | AsnicarF_2017 | MV_FEI4_t1Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 4 | AsnicarF_2017 | MV_FEI4_t2Q15 | 1.000000 | 1.000000 | 1.000000 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 5 | AsnicarF_2017 | MV_FEI5_t1Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 6 | AsnicarF_2017 | MV_FEI5_t2Q14 | 1.000000 | 1.000000 | 1.000000 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 7 | AsnicarF_2017 | MV_FEI5_t3Q15 | 1.000000 | 1.000000 | 1.000000 | Infant | NaN | feces | Italy | NaN | NaN | NaN | NaN | Male | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 8 | AsnicarF_2017 | MV_FEM1_t1Q14 | NaN | 18.000000 | 65.000000 | Adult | NaN | feces | Italy | NaN | NaN | NaN | NaN | Female | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 9 | AsnicarF_2017 | MV_FEM2_t1Q14 | NaN | 18.000000 | 65.000000 | Adult | NaN | feces | Italy | NaN | NaN | NaN | NaN | Female | NaN | Study Control | human gut microbiome | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
curated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | age_group_ontology_term_id | biomarker | body_site | body_site_ontology_term_id | country | country_ontology_term_id | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | feces_phenotype_metric_ontology_term_id | fmt_role | fmt_id | sex | sex_ontology_term_id | hla | hla_ontology_term_id | smoker | smoker_ontology_term_id | control | control_ontology_term_id | target_condition | target_condition_ontology_term_id | disease | disease_ontology_term_id | antibiotics_current_use | treatment | treatment_ontology_term_id | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 21871 | ZhuF_2020 | wHAXPI034926-15 | 22.0 | 22.0 | 22.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:70;Systolic_Blood_Pressure_in_mm/Hg:112 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Study Control | NCIT:C142703 | Schizophrenia | NCIT:C3362 | Healthy | NCIT:C115935 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21872 | ZhuF_2020 | wHAXPI037144-8 | 19.0 | 19.0 | 19.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:74;Systolic_Blood_Pressure_in_mm/Hg:107 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia | NCIT:C3362 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21873 | ZhuF_2020 | wHAXPI037145-9 | 17.0 | 17.0 | 17.0 | Adolescent | NCIT:C27954 | Diastolic_Blood_Pressure_in_mm/Hg:79;Systolic_Blood_Pressure_in_mm/Hg:137 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia | NCIT:C3362 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21874 | ZhuF_2020 | wHAXPI037146-11 | 20.0 | 20.0 | 20.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:80;Systolic_Blood_Pressure_in_mm/Hg:120 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia | NCIT:C3362 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21875 | ZhuF_2020 | wHAXPI037147-12 | 17.0 | 17.0 | 17.0 | Adolescent | NCIT:C27954 | Diastolic_Blood_Pressure_in_mm/Hg:85;Systolic_Blood_Pressure_in_mm/Hg:115 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia | NCIT:C3362 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21876 | ZhuF_2020 | wHAXPI043592-8 | 37.0 | 37.0 | 37.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:81;Systolic_Blood_Pressure_in_mm/Hg:120 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia;Schizophrenia,repeated | NCIT:C3362;EUPATH:0001011 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21877 | ZhuF_2020 | wHAXPI043593-9 | 40.0 | 40.0 | 40.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:78;Systolic_Blood_Pressure_in_mm/Hg:117 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | Smoker (finding) | SNOMED:77176002 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia;Schizophrenia,repeated | NCIT:C3362;EUPATH:0001011 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21878 | ZhuF_2020 | wHAXPI043594-11 | 25.0 | 25.0 | 25.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:83;Systolic_Blood_Pressure_in_mm/Hg:125 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Male | NCIT:C20197 | NaN | NaN | Smoker (finding) | SNOMED:77176002 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia;Schizophrenia,repeated | NCIT:C3362;EUPATH:0001011 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21879 | ZhuF_2020 | wHAXPI047830-11 | 39.0 | 39.0 | 39.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:80;Systolic_Blood_Pressure_in_mm/Hg:120 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia;Schizophrenia,repeated | NCIT:C3362;EUPATH:0001011 | no | NaN | NaN | NaN | NaN | NaN | Yes |
| 21880 | ZhuF_2020 | wHAXPI048670-90 | 38.0 | 38.0 | 38.0 | Adult | NCIT:C49685 | Diastolic_Blood_Pressure_in_mm/Hg:80;Systolic_Blood_Pressure_in_mm/Hg:120 | feces | UBERON:0001988 | China | NCIT:C16428 | NaN | NaN | NaN | NaN | NaN | NaN | Female | NCIT:C16576 | NaN | NaN | Non-smoker (finding) | SNOMED:8392000 | Case | NCIT:C49152 | Schizophrenia | NCIT:C3362 | Schizophrenia;Schizophrenia,repeated | NCIT:C3362;EUPATH:0001011 | no | NaN | NaN | NaN | NaN | NaN | Yes |
concatenated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | biomarker | body_site | country | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | fmt_id | sex | smoker | control | target_condition | disease | antibiotics_current_use | treatment | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 22578 | YassourM_2018 | G102213 | NaN | NaN | NaN | Adult | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Female | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22579 | YassourM_2018 | G104686 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22580 | YassourM_2018 | G102217 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22581 | YassourM_2018 | G102218 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22582 | YassourM_2018 | G102211 | NaN | NaN | NaN | Adult | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Female | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22583 | YassourM_2018 | G102212 | NaN | NaN | NaN | Adult | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Female | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22584 | YassourM_2018 | G104681 | NaN | NaN | NaN | Adult | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Female | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22585 | YassourM_2018 | G102214 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22586 | YassourM_2018 | G102215 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
| 22587 | YassourM_2018 | G102216 | NaN | NaN | NaN | Infant | NaN | feces | Fiji | NaN | NaN | NaN | NaN | Male | NaN | NaN | Schizophrenia | Healthy | NaN | NaN | NaN | NaN | NaN | Yes |
Duplicate rows
curated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | age_group_ontology_term_id | biomarker | body_site | body_site_ontology_term_id | country | country_ontology_term_id | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | feces_phenotype_metric_ontology_term_id | fmt_role | fmt_id | sex | sex_ontology_term_id | hla | hla_ontology_term_id | smoker | smoker_ontology_term_id | control | control_ontology_term_id | target_condition | target_condition_ontology_term_id | disease | disease_ontology_term_id | antibiotics_current_use | treatment | treatment_ontology_term_id | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | ||||||||||||||||||||||||||||||||||||||
concatenated_md_report
| study_name | sample_id | age_years | age_min | age_max | age_group | biomarker | body_site | country | dietary_restriction | feces_phenotype_metric | feces_phenotype_value | fmt_id | sex | smoker | control | target_condition | disease | antibiotics_current_use | treatment | tumor_staging_ajcc | tumor_staging_tnm | unmetadata | westernized | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||||||||||||||||